Environment setup for big data analytics

This articles covers basic tools and technologies to use when conducting the first steps on big data analysis.

  • Linux as the base OS
  • For basic data processing:
  • And the big data analysis framework chosen based on the type of data analyzed. For the first step tutorials our suggestion would be:
    • Hadoop, single cluster setup (can be downloaded pre-installed to a virtual appliance)
    • Java based MapReduce programs
    • Pig MapReduce query language