1, Alibaba
Alibaba has transaction data, credit data, and more is to build the underlying architecture of data circulation, collection and sharing.
2. Huawei Huawei Cloud Service
IT integrates high-performance computing and storage capabilities, and provides a professional and stable IT infrastructure platform for big data mining and analysis. Recently, Huawei's big data storage has realized the unified management of 40PB file system.
3. Baidu (the world's largest Chinese search engine)
Baidu's advantages are embodied in massive data, user behavior data accumulated for more than ten years, natural language processing ability and cutting-edge research in the field of deep learning. Recently, Baidu officially released the big data engine, which will take the lead in foreign cooperation in traditional fields such as government, medical care, finance, retail and education.
Big data, or huge amount of data, refers to the information that involves so much data that it can't be captured, managed, processed and sorted by the current mainstream software tools to help enterprises make more active decisions within a reasonable time.
In The Age of Big Data, co-authored by Victor Meyer-Schoenberg and Kenneth Cookeye, big data means that all data are used for analysis and processing, and there is no shortcut to random analysis (sampling survey). 5V characteristics of big data (proposed by IBM): volume (mass), speed (high speed), diversity (diversity), value (low value density) and authenticity.
Gartner, a research institute of "big data", gives such a definition. "Big data" is an information asset, which needs a new processing mode to have stronger decision-making, insight and process optimization capabilities to adapt to mass, high growth rate and diversification.
The definition given by McKinsey Global Institute is that the scale of data sets far exceeds the capabilities of traditional database software tools in acquisition, storage, management and analysis, with four characteristics: massive data scale, rapid data flow, diverse data types and low value density.