The platform is not tied to a specific language, in our examples, we use a lot of Python, because it is known to be used in the remote sensing community. Next to that, we also use a lot of Java and Scala, because a lot of the more advanced API's in the Hadoop ecosystem work with these languages. We have however also done extensive testing on Hadoop with Python (pyspark) and integration of C++ or Fortran code.

The only limitations to the software that can be used in VM or on the cluster are:

  • It has to run on CentOS 7, a stable, enterprise grade, Linux distribution.
  • Commercial software that requires a license is not provided, but can be used as long as you provide your own license, and there are no technical or legal issues that prevent you from using it on a VM or cluster environment.