Difference between Pandas .iloc and .loc function

The optimized data access methods are accessed by indexing off of the .loc and .iloc attributes. These two attributes allow label-based and position-based indexing respectively. When we perform an index operation on the .iloc attribute, it does lookup based on index position (in this case pandas behaves similar to a Python list). DataFrame operation: .loc…

Read More

Hierarchical clustering for Machine Learning

Hierarchical clustering is another unsupervised machine learning algorithm, which is used to group the unlabeled datasets into a cluster. Hierarchical Clustering creates clusters in a hierarchical tree-like structure (also called a Dendogram) as it creates a subset of similar data in a tree-like structure in which the root node corresponds to the entire data, and…

Read More

Difference between Data Science and Machine Learning

Data Science Data science is a field that studies data and how to extract meaning from it, using a series of methods, algorithms, systems, and tools to extract insights from structured and unstructured and unstructured data. That knowledge then gets applied to business, government, and other bodies to help drive profits, innovate products and services…

Read More

Difference between Data Scientist and Data Analyst

What are their skills? Data Analyst Data Mining Data Warehousing Math, Statistics Tableau and data visualization SQL Business Intelligence Advanced Excel skills Data Scientist Data Mining Data Warehousing Math, Statistics, Computer Science Tableau and Data Visualization/Storytelling Python, R, JAVA, Scala, SQL, Matlab, Pig Economics Big Data/Hadoop Machine Learning Educational requirements Data Analyst Foundational math, statistics…

Read More

Difference between Data Scientist and Data Engineer

What do they do? Data Engineers Data Engineers design, build, test, integrate, and optimize data collected from multiple sources. They use Big Data tools and technologies to construct free-flowing data pipelines that facilitate real-time analytics applications on complex data. Data Engineers also write complex queries to improve data accessibility. Data Scientist Data Scientists are more…

Read More

Difference between Big Data and Data Science

Big Data Hugh volumes of data which cannot be handled using traditional database programming. Characterized by volume, variety, and velocity. Data Science A data-focused on scientific activity. Approaches to process big data. Harnesses the potential of big data for business decisions. Similar to data mining. Concept Big Data Diverse data types generated from multiple data…

Read More

Credit Card Fraud Detection using Machine Learning

As we’re moving towards the digital world — cybersecurity is getting a critical part of our life. When we talk about security in digital life also the main challenge is to find the abnormal activity. When we make any transaction while buying any product online — a good amount of people prefer credit cards. The…

Read More

K-Means algorithm for Machine Learning

K-Means Clustering is an Unsupervised Learning algorithm, which groups the unlabeled dataset into different clusters. It allows us to cluster the info into different groups and a convenient way to discover the categories of groups in the unlabeled dataset on its own without the need for any training. The k-means clustering algorithm mainly performs two…

Read More

DBSCAN algorithm for Machine Learning

Density-based special clustering of applications with noise or DBSCAN is a density-based clustering method that calculates how dense the neighborhood of a data point is. the main idea behind DBSCAN is that a point belongs to a cluster if it is close to many from that cluster. It will measure the similarity between data points,…

Read More

Default argument and Ternary operators in Python

The objects like list, dict are mutable. A mutable object can change its state or contents, so whenever we use these types of mutable objects as default argument in Python functions, and execute or call the function multiple times it gives unnecessary output for each function call. The example shown below is to return the…

Read More