Posts

Showing posts with the label Python

Pandas - Series

Image
Introduction Python is a really powerful language and any piece of software like web application, windows application, ML model etc  can be built using Python. Prior to Pandas, Python was majorly used for data munging and preparation. It had a very little contribution towards data analysis. Pandas solved this problem. Using Pandas, we can accomplish five typical steps in the processing and analysis of data, regardless of the origin of data — load, prepare, manipulate, model, and analyze. Python with Pandas is used in a wide range of fields including academic and commercial domains including finance, economics, Statistics, analytics, etc. This blog provides code for basic usage of Series Data Type in Pandas. Data types in Pandas Series : 1D,labeled homogeneous array, size immutable. Data Frames : General 2D labeled, size-mutable tabular structure with potentially heterogeneously typed columns Panel : General 3D labeled, size-mutable array. Seri

Machine Learning -Solution or Problem

Image
The article will be divided into different sections as follows: Introduction to Machine Learning Types of Solutions Classification using Naive Bayes A brief about Machine Learning According to the definition by Wikipedia,  Machine learning  is the subfield of  computer science  that, according to  Arthur Samuel  in 1959, gives "computers the ability to learn without being explicitly programmed."  Machine Learning defines a set of problems that have to be evolved through the data by implying some algorithm. One factor that has to be kept in mind while defining a solution through ML is accuracy. Accuracy is very critical in case you are developing a solution in medical domain(cancer detection).There should be a threshold set for every solution which can be based on risk %age that is acceptable. A useful cheatsheet from Microsoft's site to sum up the use of different ML algorithms for the different type of problems. Types of solution Machine Lear