Today we are going to learn about the new releases from Scikit-learn version 0.22, a machine learning library in Python. We, through this video tutorial, aim to learn about the much talked about new release called Plotting API. Prior to this version, Scikit-learn did not have a function to plot the ROC curve.
A new plotting API is available for creating visualizations. The new API allows for quickly adjusting the visuals of a plot without involving any recomputation. It is also possible to add different plots to the same figure. In this tutorial we are going to study the plotting of the ROC curve.
The code sheet is provided in a Github repository here.
We will attempt to plot the ROC curve on two different algorithms and compare which one is a better function. First we choose to make a classification data. Then we go on to plot the ROC curve using SVC classifier and then further plot the curve using a random forest classifier.
Even as the coronavirus pandemic rages on and India is living through a strict lockdown to abate the spread of the novel virus, a disastrous spell of a plague of crop destroying locusts has struck Rajasthan, Gujarat and parts of Madhya Pradesh.
Threatening to balloon into an agrarian crisis, the destruction of crops on this scale is being seen as one “worst in decades”. In fact, such large scale breeding of locusts and an attack by them is the worst in 27 years, government officials said.
In such frightening circumstances, what we can truly bank upon to detect and fight locust attacks is advanced technology like machine learning techniques. This essay aims to demystify how machine learning can be used to detect locust breeding patterns by studying soil moisture through remote sensing.
A study called “Machine learning approach to locate desert locust breeding areas based on ESA CCI soil moisture” shows how researchers have “used two machine learning algorithms (generalized linear model and random forest) to evaluate the link between hopper presences and SM (Soil Moisture) conditions under different time scenarios…It was found that an area becomes suitable for breeding when the minimum SM values are over 0.07 m3 / m3 during 6 days or more. These results demonstrate the possibility to identify breeding areas in Mauritania by means of SM, and the suitability of ESA (European Space Agency) CCI (Climate Change Initiative) SM product to complement or substitute current monitoring techniques based on precipitation datasets.”
The study found that “it is widely assumed” that rainfall over 25 mm in two consecutive months is conducive to locust breeding. Likewise, various soil moisture conditions affect breeding patterns greatly. So, the study finds that it is important to have “variable creation as a previous step to modeling”. Different time intervals of locust breeding were tested by the researchers for model creation. Also, different soil moisture values were considered.
It was found that the “highest performance was acquired by the RF (Random Forest) algorithm when dividing the whole survey time into ranges of 6 days, and selecting the minimum SM as the variable value.” GLMs of Generalised Linear Models, however, did not work well according to the study.
The applied methodology of machine learning offers promising results to accurately identify breeding areas based on data pertaining to 30 years of SM values. The ESA CCI soil moisture data is one of the most authoritative ones in the world. Thus the researchers who conducted this study are confident that their results signify a breakthrough in locust monitoring technique prevalence in the world.
This study, thus, proposes a machine learning approach based on SM time series “to predict breeding areas, by means of remote sensing”. Artificial Intelligence and Machine Learning will help future researchers and scientists to study and produce better warning systems based on the results of this study. In this study only soil moisture data has been used but more variables like temperatures can also be taken into account to accurately predict breeding grounds in the future.
The Covid-19 pandemic has struck India like it has scores of countries across the world. As of May 27, over 1,51,000 Indians have been tested positive for the novel virus and over 4000 people have died due to the contagious disease. India has been under lockdown for over two months now in an attempt at abating the spread of the virus due to movement and contact.
With all offices closed and work from home decreed across numerous sectors of the economy, professionals have been forced to adapt to a new mode of work and training. With more time on hand since they are working from home, professionals are upgrading their skills by taking up online training modules and classes. A recent LinkedIn survey throws light on this phenomenon.
LinkedIn’s Work Force Confidence Index
India’s foremost social networking site that helps individuals network with professional peers and find jobs and appointments has conducted a survey called Work Force Confidence Index. As per the survey conducted between April 27 and May 3, “India’s professionals are logging learning hours for not just knowledge acquisition but also to increase productivity. About half of respondents from mid-market firms joined courses that help them manage time better, improve prioritisation or stay organised”.
93% respondents to upskill online in next two weeks
According to LinkedIn News India, 1040 professionals were surveyed by LinkedIn and 93% of them said “their time spent on e-learning will either increase or remain the same over the next two weeks”. Moreover, 60% of the respondents of which 74% were from the engineering domain said e-learning was a conduit to furthering industry knowledge. “Advancing in one’s career was a driver for 57% of all respondents and 3 in 10 active job seekers undertook e-learning to make a career pivot,” said LinkedIn News India.
What respondents learnt
Of the respondents, 45% said they hoped to learn to collaborate with peers through online learning in lockdown. Also, 43% said they wished to learn to manage time and prioritise and stay organised. Moreover, 40% said they hoped to learn something unrelated to work through online platforms. Becoming a leader and managing personal finances were pegged at 37% and 32% respectively by the study as goals and 24% said e-learning could actually lead to a change in career paths for them.
Advantages of e-learning
Travelling to work and back is taxing and time consuming. When you are working from home, you save on energy and time that can be used for something productive like e-learning training modules. They are easy on the pocket, accessible from absolutely anywhere you are and convenient to absorb and retain information and new things learnt. Moreover, there is a large online community to help you out with study material and guidance.
The world has seen a transformation in its economic activities since the coronavirus pandemic broke out. Economies have come to a grinding halt and manufacturing has dipped. Now what nations need is resilience and strength to carry on production in all sectors. What they are most depending on is the power of Artificial Intelligence to enhance the manufacturing process and help save money and drive down costs.
Here are some examples of how AI is powering the manufacturing sector in 2020.
AI is being used to transform machinery maintenance and quality in manufacturing operations today, according to Capgemini.
Caterpillar’s Marine Division is using machine learning to analyze data on how often its shipping equipment should be cleaned helping it save thousands of dollars.
The BMW Group is using AI to study manufacturing component images in and spot deviations from the standard production procedure in real-time.
In fact, a study shows that in the four earlier global economic downturns companies using AI were actually successful in increasing both sales and profit margins. Companies are all striving to utilize human experience, insights and AI techniques to give manufacturing a fillip in these times of a crisis.
Manufacturing using AI in real-time
Real-time monitoring of the manufacturing process is advantageous because it translates to sorting out production bottlenecks, tracking scrap rates and meeting customer deadlines among other things. The huge cache of data used can be utilized to build machine learning models.
Supervised and unsupervised machine learning algorithms can study multiple production shifts’ real-time data within seconds and predict processes, products, and workflow patterns that were not known before. A report suggests 29% of AI implementations in manufacturing are for maintaining machinery and production assets.
It was found that the most popular use of AI in manufacturing is predicting when equipment are likely to fail and suggesting optimal times to conduct maintenance. Companies like General Motors analyze images of its robots from cameras mounted above to spot anomalies and possible failures in the production line and thus preempt outages.
General Motors uses AI algorithms to give and produce optimized product design. General Motors can achieve the goal of rapid prototyping with the help of AI and ML algorithms. Designers provide definitions of the functional needs, raw materials, manufacturing methods and other constraints and the company along with AutoDesk has customized Dreamcatcher to optimize for weight and other vital criterion. In this way, AI comes together with human endeavor to produce a-class product designs that cost lesser.
Nokia has begun using a video application that takes the help of machine learning to alert an assembly operator if there are inconsistencies in the production process in one of its factories in Oulu, Finland. It alerts a machine operator about inconsistencies in the production of electronic items and this helps preempt poor production process and helps the company save on a lot of money and capital.
There are many other production processes AI is helping revolutionize. Only time will tell how much of AI will power the manufacturing sector. But this technological advancement is surely making an impact on economies worldwide. Meanwhile, for more details, do peruse the DexLab Analytics website. DexLab Analytics is a premiere machine learning institute in Gurgaon.
Today we are going to learn about the new releases from Scikit-learn version 0.22, a machine learning library in Python. First we learn how to install it on our systems. Then, we come to the much talked about new release called stacking regression.
Now, how does stacking regression work? Well, you have been using machine learning algorithms like Decision Tree or Random Forest. Have you heard of Voter Classifier? It is an algorithm in Scikit-learn. Ensemble algorithm is a combination of two or more algorithms to make it stronger.
When working on a set of data, we must apply all these algorithms to get predicted values. Then we vote out classified predicted values in Voter Classifier. Stacking Classifier is different. What we are doing in it is stacking together the predicted values to make a new input.
Initially, we make prediction by using various algorithms separately. Their results or output are then concatenated together. Then we use this output as a new input and apply the algorithms to it to get target variable. This method is known as stacking regression.
We try this out on a data set that can be taken from a github repository the link to which is given below.
Then we use two algorithms as estimators. Then we use stacking regression to build a model. For more on this do watch the video attached herewith. This tutorial was brought to you by DexLab Analytics. DexLab Analytics is a premiere Machine Learning institute in Gurgaon.
Chatbots or “conversational agents” are software applications that mimic or imitate written or spoken human speech for the purposes of facilitating a conversation or interaction with a human being.
These applications have become one of the most ubiquitous software applications out there with the advancement of machine learning technology and NLP.
“Today’s chatbots are smarter, more responsive, and more useful – and we’re likely to see even more of them in the coming years… chatbots are used most commonly in the customer service space, assuming roles traditionally performed by living, breathing human beings such as Tier-1 support operatives and customer satisfaction reps.”
Conversational agents are becoming a common occurrence partly due to the fact that barriers to entry in creating chatbots such as sophisticated programming knowledge have become redundant.
How Chatbots work
The crux of chatbot technology is natural language processing or NLP, the same technology “that forms the basis of the voice recognition systems used by virtual assistants such as Google Now, Apple’s Siri, and Microsoft’s Cortana.” “Chatbots process the text presented to them by the user…infer what they mean and/or want, and determine a series of appropriate responses based on this information.”
Here are 5 companies using chatbots for various roles like marketing, communicating with marginalized groups and patients suffering from sleeplessness and memory loss.
Russian technology company Endurance developed a companion chatbot to help dementia patients cope with decreased verbal ability. Many patients with Alzheimer’s disease use the chatbot to converse with. In turn, the chatbot identifies deviations in conversational patterns of the patient that might indicate a problem with memory and recollection.
Casper’s Insomnobot 3000 is a conversational agent that aims to help insomniacs by posing as a companion to talk to while the rest of the world sleeps. However, at this point, “Insomnobot 3000 is a little rudimentary.”
International child advocacy nonprofit UNICEF is using chatbots to help people living in developing countries speak out about the most urgent needs in their communities. The bot, named U-Report, focuses on large-scale data gathering via polls. UNICEF then uses feedback as the basis for potential policy recommendations.
This chatbot aims at making medical diagnoses faster, easier, and more transparent for both patients and physicians. MedWhat is powered by a highly sophisticated machine learning system that offers increasingly accurate responses to user questions based on behaviors that it “learns” by interacting with human beings. Also, it acts as a repository of a vast source of medical journals and medical advice.
Roof Ai is a chatbot that helps real-estate marketers to “automate interacting with potential leads and lead assignment via social media”. The bot identifies potential leads via social media and responds immediately, irrespective of the time of the day. “Based on user input, Roof Ai prompts potential leads to provide a little more information, before automatically assigning the lead to a sales agent.”
Machine Learning is an acquired knowledge science. It has to be taught and studied. For this, it is imperative to have the best books on the subject at hand. However, most books on the subject are expensive and not easily accessible. This is only fair given the amount of hard word that goes into writing these books.
In a situation this critical, it is best to rely on the good old Internet for assistance. There are some good Samaritans who have chosen to make their works freely available to all. Here is a great guide to free ebooks available online so you can brush up on your concepts and be industry ready at the earliest.
Think Stats – Probability and Statistics for Programmers by Allen B Downey
This is an introduction to statistics and probability for those who have a basic grounding in Python programming. “It’s based on a Python library for probability distributions (PMFs and CDFs). To make things easier for the reader, most of the exercises have short programs,” says a report.
Bayesian Reasoning and Machine Learning by David Barber
When it comes to Bayesian statistics, this book is a classic. “This takes a Bayesian statistics approach to machine learning.”This is a book worth checking out for anyone getting into the machine learning field and trying to make a career out of the subject.
An Introduction to Statistical Learning by Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani
This popular entry is an introduction to data science through machine learning. “This book gives clear guidance on how to implement statistical and machine learning methods for newcomers to this field. It’s filled with practical real-world examples of where and how algorithms work. For those with an inclination towards R programming, this book even has practical examples in R.”
Understanding Machine Learning by ShaiShalev-Shwartz and Shai Ben-David
“This book gives a structured introduction to machine learning. It looks at the fundamental theories of machine learning and the mathematical derivations that transform these concepts into practical algorithms. Following that, it covers a list of ML algorithms, including…stochastic gradient descent, neural networks, and structured output learning.”
A Programmer’s Guide to Data Mining by Ron Zacharski
This book has chapters covering recommendation systems. “It takes a…visually entertaining look at social filtering and item-based filtering methods and how to use machine learning to implement them. Other concepts like Naive Bayes and Clustering are also covered. There is a chapter on Unstructured Text and how to deal with it, in case you are thinking about getting into Natural Language Processing. Examples in Python are also available in case you want to practice.”
In the midst of a crisis as big as COVID-19 which has as per the current update taken more than 80,000 lives around the world and caused nations to go under a lockdown, machine learning and artificial intelligence has played a vital role in detecting COVID-19 positive cases and helping fast track the tests and analysis of drugs that can be a solution to the problem.
Countries around the world are building algorithms to better understand the molecular structure of the diseaseso that the so-called protein around the SARS-CoV-2, the virus that causes COVID-19, can be blocked.
Use of AI and machine learning around the world
The legendary story of a Canada-based AI named Bluedot is not something one can forget. Buledot is a low cost AI based tool that predicted the outbreak of the disease in December 2019 and also predicted the top 20 destinations where passengers from Wuhan, the origin of pandemic, would arrive.
Deargen a South-Korean company built an AI tool named MT-DTI to test and analyze the molecular bond of the SARS-CoV-2 protein and recommend medication on the basis of the same. Even though their recommendations are yet to be reviewed and tested properly, their find might prove to be a super solution to reduce the number of death counts and covid positive cases.
Insilico Medicine, a Hong-Kong based company instead of building a platform to recommend the currently available medication to break the protein of covid-19 virus and stop it from replicating, built an AI based tool to formulate and test the potency of new and chemically advanced medication to break the chain.
Benevolent AI, a British startup based on its machine learning tool discovered six important covid-19 virus fighting compounds that can prove to be helpful in curing and limiting the spread of the disease. Out of the six compounds discovered, ‘baricitinib’, a compound used to cure rheumatoid arthritis according to the sources, proves to be the safest compound that can be tested on the covid positive volunteers.
Use of AI and machine learning in India
The spread of the covid-19 pandemic in India began by the end of January 2020 and till date more than 5000 cases have been reported. The Government of India declared a lockdown by mid-March 2020 and in the middle of all this when misinformation and rumors were rising day by day, to tackle the problem the Government of India launched its covid-19 tracking app called Aarogya Setuthat means ‘a bridge of health’.
This AI-based app uses bluetooth and the mobile location of the user and informs them if there is any person covid positive within 6 feet of their location or not. The app checks the user’s data in their own database and in case the data matches a corona positive patient then it only notifies their presence keeping their identity anonymous.
India has also launched a whatsapp chatbot and any one using the whatsapp platform can access the chatbot by simply typing “MyGov Corona Helpdesk” and get the relevant and authentic information relating to the symptoms and how they can get help from direct sources.
Apart from the above initiatives to tackle the pandemic, medical practitioners, data analysts and data scientists from around the world are trying to find ways and means and analyzing the database to capture the trends and are working day and night to find best possible cure to fight the disease.
The world has come to a grinding halt with the spread of the deadly novel coronavirus (COVID-19), one of the most contagious diseases to affect us as a people of late.
In little over three months since the virus was first detected in China’s Wuhan late last year, it has spread to more than 90 countries, infected almost a million people, and taken about48,000 lives as of Thursday, April 2, 2020.
Scientists, governments and health organizations are doing the best they can to contain and fight the disease and they are trying to take all the help they can. Including the help of technology – and Artificial Intelligence – in tracking, diagnosing cases, disinfecting areas and speeding up the hunt for a cure.
Here is a short essay on how artificial intelligence is assisting in the fight against COVID-19.“Data science and machine learning might be two of the most effective weapons we have in the fight against the coronavirus outbreak,” says a report.
Tracking the virus outbreak with machine learning
In December last year, a Canada-based artificial intelligence platform that tracks the spread of infectious diseases around the world, detected a cluster of “unusual pneumonia” cases reported in China’s Wuhan. You can read about the detection of the disease here.
The Toronto-based start-up uses Natural Language Processing or NLP and machine learning algorithms to scour tons of information on infectious diseases from sources like statements from health organizations, commercial flights and live stock health reports. It alerted its clients about the outbreak of the disease almost a week before international health organizations declared the outbreak of COVID-19 in China.
Computer Vision being used to detect coronavirus infection
As of today, authorities the world over are checking temperatures of their citizens at airports and railway stations or other crowded places through thermal guns and manually screening them for signs of fever, cough and breathing difficulties.
However, computer vision algorithms can accelerate this crucial screening process by equipping cameras with computer vision technology.
In China, a tech giant has invented a thermal scanner that uses computer vision and infrared sensors to check people’s temperatures in public places. The system invented can check up to 200 people per minute.
Another tech giant has invented a system that uses AI to detect coronavirus in chest CT scans (in 20 seconds as opposed to the 15 minutes spent by human health workers) with a 96 per cent accuracy rate. The system is reportedly being adopted by 100 hospitals in China.
AI is speeding up drug research
Developing new drugs and vaccines, in the race to win the fight against coronavirus, is proving long and tedious. Some reports suggest it can take anywhere near 12 years at a cost of billions of dollars.
However, that said, AI will help speed up the process of drug discovery to some extent. “DeepMind, the AI research lab acquired by Google in 2014, recently declared that it has used deep learning to find new information about the structure of proteins associated with COVID-19”, a process that could have taken many more months.
“Understanding protein structures can provide important clues to the coronavirus vaccine formula. DeepMind is one of several organizations engaged in the race to unlock the coronavirus vaccine. It has leveraged the result of decades of machine learning progress as well as research on protein folding.”