The advantages of Tianyancha are as follows:
It has a large scale of 100 million users, 1,000 enterprise users, high popularity, and the sum of Baidu index among its friends!
1. There are many data subjects: as many as 159 million, and the latest ones include social organizations, Hong Kong, Macao and Taiwan. The actual number of competing products is currently 100 million, while the other one only has more than 80 million.
2. Multiple dimensions: In addition to common dimensions such as industrial and commercial information, court announcements, competitive product information, recruitment, dishonest persons, patents and trademarks. Tianyancha also has 80 dimensions including qualification certificates, software copyrights, work copyrights, WeChat public accounts, Weibo company accounts, Weibo personal accounts, Weibo accounts, social organizations, law firms, judicial auctions, Hong Kong enterprises, import and export, etc. And in the successive launches, competing products have only reached more than 50 dimensions.
3. High update frequency: dynamically allocate thousands of crawler processes to crawl data 24 hours a day, with 20,000-100,000 new additions every day, and 4-5 million updates ensuring that the entire database can be updated in about a week. All can be updated.
4. Rich data sources: In addition to crawling, we have signed exclusive cooperation with many data source companies because of the leading industry in the industry, and obtained data sources from official or industry leading companies. For example, in addition to industrial and commercial sources of administrative penalties, There are more than 200 types of qualification certificates in Credit China. Listed company information data is directly exchanged with the Great Wisdom database for 800 data tables. The trademark information database is directly from the Trademark Office in cooperation with Master Quan. Case information is directly cooperated with Peking University Magic, etc., and is updated simultaneously. .
5. Leading data collection technology and cleaning technology: independently developed with patents, self-learning decoding, massive (18 million) IPs to circumvent firewalls, and simulate user login behavior to obtain data.
6. Leading data storage technology: Large-scale associated data traceable spatio-temporal relationship network TSTN includes graph storage and graph analysis technology. The number of entities and relationships is billions, and the number of entities and relationships is There are tens of billions of attributes, ensuring effective storage and fast graph-based correlation queries.
7. Leading data analysis technology: independently developed patented technology, testing the same correlation analysis algorithm in three data storage systems (traditional relational databases represented by Oracle and MySQL, and Hadoop and HBase). Comparing the performance of the representative key-value pair storage solution with the TSTN system of Tianyancha, the time is shortened from 28 hours to less than 3 seconds.
8. Leading data synchronization technology: Linked data and traditional data synchronization update technology, independently developed patented technology, do not compete with traditional macro big data based on statistics, but include both macro big data , more prominently the unity of micro-big data.
9. Location advantage: Tianyancha company is headquartered in Beijing, with convenient after-sales follow-up and communication on technical issues. It is also convenient to communicate with industry leading partners and can visit each other at home.
10. Brand advantages: Tianyancha is legal and compliant, has a large market share, the number of users has reached 100 million, abundant funds, low-priced and high-quality products, and Baidu’s popularity is twice that of the two competing products combined. times, the brand value is high, and long-term and stable cooperation can be achieved.