I will make a high-quality data collection of information (parsing, scraping, crawling) from websites. Scrape data is performed from sites with open data that are freely available and are not prohibited from distribution.
What is included:
One-time collection of up to 10,000 data from one (1) open-source website. The result can be obtained in a CSV, XLS, XML, JSON, or any other format. If you wish, I will remove the excess and add the necessary one.
What sites can I ask for? Those sites where the information is publicly available and are not protected from parsing, including online stores, content sites, catalogs, directories, sites with ads, etc.
What data can be scraped from these sites?
- Products: name, article, price, photo, description, specifications, etc. ;
- Articles: texts, photos;
- Ads: texts, price, and photos;
- Posts: texts and photos;
- Companies: address and contact details.
Additionally:
- Urgent parsing;
- Downloading Files;
- Image optimization-reduce the size by 10-80% without loss of quality;
- Importing data to your site...
P. S. It is not possible to collect data from all sites, because there may be protection against parsing, IP blocking, etc. Please check this before ordering.