If you are in the big data or data science or BI space, you might have heard about Apache Spark....
Data Science
In the previous post, we figured out how to connect MongoDB with Apache Drill and query data with SQL queries....
Not a lot of people have heard of Apache Drill. That is because Drill caters to very specific use cases,...
If you’ve worked with Spark SQL, you might have come across the concept of User Defined Functions (UDFs). As the...
A couple of days back, we saw how we can connect Apache Spark to an Apache HBase database and query...
There will be times when you’ll need the data in your HBase database to be brought into Apache Spark for...
Multicollinearity is a term we often come across when we're working with multiple regression models. Even though we have talked...
In most of our posts about machine learning, we've talked about overfitting and underfitting. But most of us don't yet...
Now that we know what is feature selection and how to do it, let's move our focus to validating the...
In our previous post, we discussed what is feature selection and why we need feature selection. In this post, we're...