Zookeeper is core component of Big Data zoo. It is distributed system which coordinates different aspects of other distributed systems…
If you need really fast response on your Big Data you should check this database – blinkdb. Some information taken from…
You put your data into fancy libraries, algorithms etc., get some results out. You see some statistical properties of your data.…
Did you ever wonder why in hadoop three is the default replication factor? I heard this question, but without answer,…
If you need fast introduction to Spark, but more advanced than “Spark is cool and you should use it!”, than…
Business Intelligence on Big Data? “That’s easy!” you think. Just connect to hadoop with Hive jdbc/odbc driver and you good…
I think not many Hortonworks (one of the leading Hadoop distributions) administrators know about Grafana (very cool tool for visualising…
I listened to http://softwareengineeringdaily.com/2016/07/18/peter-bailis-on-the-data-communitys-identity-crisis/ Once again it was worth it! Thank you SED 🙂 On this podcast you get to…
Very informative podcast http://softwareengineeringdaily.com/2015/11/11/apache-flink-with-stephan-ewen/ It’s interview with Stephan Ewen, commiter to the Flink project committer and the CTO of Data Artisans.…