Oct 26, 2021
No image
System for automatic crawling and parsing sites
Completed

System for automatic crawling and parsing sites

$10,000+
2-3 months
United States
2-5
view project
Service categories
Service Lines
Software Development
Ecommerce
IT Services
Domain focus
Business Services
Commerce
Technology
Programming language
Java
JavaScript
TypeScript
Frameworks
Angular.js
React.js
Spring

Challenge

A system for automatic crawling and parsing sites with a different structure. Has to support many languages, uses Google Translate API for translating data into English.
A system for automatic crawling and parsing sites with a different structure. Has to support many languages, uses Google Translate API for translating data into English.

Solution

Usually used for parsing HTML, also can extract text from pdf, doc, and other attachment files. Can check updates with some interval, ignore unnecessary content.
Usually used for parsing HTML, also can extract text from pdf, doc, and other attachment files. Can check updates with some interval, ignore unnecessary content.

Results

From the obtained data is creating the posts for its own resources. All posts sorting by categories and are available for users.
From the obtained data is creating the posts for its own resources. All posts sorting by categories and are available for users.