Diverse structured, unstructured, and semi-structured Data that were generated from the various sources need to be reduced to the
same standard for the data to be understandable and flow among diverse systems involved in processing the data.
Big Data consists of heterogeneous datasets from many sources and the datasets need to be reduced to the same format.
for systems interoperability. Some of the formatting tools include XML, AVro, JSON and Parquet.
Discus the roles XML, AVro, and JSON, which are the popular data formatting tools in Big Data standardization.
Discuss the need for Big Data standardization.
List the various tools that can be used to achieve Big Data Standardization
What is XML? What is AVro? What is JSON?
Discuss the roles of XML, AVro and JSON in Big Data formatting.
Jason & Big Data