A few
days back, one of my friends asked me about
the technique behind relevant results shown to him by Flipkart via a push
notification. So, I thought to clear this up here. Definitely, you guys have
read the title of this post and you have guessed that I am going to talk about
Data mining here. Breaking Data mining into
data and mining, we say data is anything from a 1 MB image file to a 1 GB
movie, it is the transactions you make or surf anything on the web. Mining is
the extraction of something from large
deposits. Eg. Gold mining from sands and rocks.
What
is Data Mining?
Data Mining is the
process of extraction of some knowledge patterns from a large amount of data stored in data warehouses, databases or some
repository. Therefore, Data mining can also be called as Knowledge Mining. Data
mining has a lot of application in Enterprise and Big Organizations. For eg. An
Online shopping cart uses Data mining to show most shopped products to their
customers to increase their sales. Data mining method often falls on the
intersection of Artificial Neural
Network, Machine Learning, Statistics,
and Databases.
The Steps involved in
Data Mining are:-
1. After
extracting the data, we clean the data. i.e Data is made free of noises.
2. Data
Integration. i.e We Integrate data from
different spaces.
3. Data
selection. i.e we select the data to be mined.
4. Data
Transformation. i.e data should be transformed in the form in which it can be mined.
5. Data
Mining. Here we use Data mining algorithm to find some patterns from the analyzed data.
6. Pattern
Recognition. Here we figure out model from the analyzed
pattern.
7. Visualize
pattern. Here we show the figured pattern by some visual model like charts,
etc.
So, In simple terms, we first collect and capture data in
data warehouses and analyze some patterns
from it and then show those results in some visual form.
Things Required for
Data mining are:-
1. Abundant Data.
2. Powerful
Computers.
3. Data
Mining Algorithms.
There are many
techniques for data mining:-
1. Artificial
Neural Network. It uses some models that learn
from the data available.
2. Decision
trees. This structure represents some rules at every branch to classify
datasets.
3. Rule
Induction. It uses some old stats to analyze
patterns.
Data Mining basically
sets some model and referring those models we think of near future trends. Big Organisations
are using data mining to increase their sales and help their users in every
possible way. Facebook analyses our data and shows us relevant posts and recommend friends. Flipkart and Amazon look for geographical locations where their sales are more and can improve their
services in that regions.
Data Mining algorithm
are improved overs years to give us quick and right results. I hope this
information helps and finally Thank you.
No comments:
Post a Comment