If you are in the big data or data science or BI space, you might have heard about Apache Spark....
data science
In the previous post, we figured out how to connect MongoDB with Apache Drill and query data with SQL queries....
Not a lot of people have heard of Apache Drill. That is because Drill caters to very specific use cases,...
If you’ve worked with Spark SQL, you might have come across the concept of User Defined Functions (UDFs). As the...
There will be times when you’ll need the data in your HBase database to be brought into Apache Spark for...
Multicollinearity is a term we often come across when we're working with multiple regression models. Even though we have talked...
In most of our posts about machine learning, we've talked about overfitting and underfitting. But most of us don't yet...
Now that we know what is feature selection and how to do it, let's move our focus to validating the...
In our previous post, we discussed what is feature selection and why we need feature selection. In this post, we're...
If you've come across a dataset in your machine learning endeavors which has more than one feature, you'd have also...