0% found this document useful (0 votes)

3K views4 pages

Scikit-learn Release Notes July 2017

Scikit-learn is a popular machine learning library for Python. It provides simple and efficient tools for data mining and data analysis. Scikit-learn contains algorithms for clustering, classification, and regression. It integrates well with NumPy, SciPy, and other Python scientific libraries. Scikit-learn was originally developed in 2007 and has grown significantly, with over 1.3 million downloads per month as of 2023.

Uploaded by

levin696

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3K views4 pages

Scikit-learn Release Notes July 2017

Uploaded by

levin696

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Overview and Version History
scikit-learn Tools and References
References Continued
External Links

scikit-learn

scikit-learn (formerly [Link] and also known as sklearn) is

a free software machine learning library for the Python
scikit-learn
programming language.[3] It features various classification,
regression and clustering algorithms including support-vector
machines, random forests, gradient boosting, k-means and
DBSCAN, and is designed to interoperate with the Python
Original author(s) David
numerical and scientific libraries NumPy and SciPy. Scikit-learn is
Cournapeau
a NumFOCUS fiscally sponsored project.[4]
Initial release June 2007

Overview Stable release 1.3.0[1] / 30

June 2023
The scikit-learn project started as [Link], a Google Summer Repository [Link]
of Code project by French data scientist David Cournapeau. The /scikit-learn
name of the project stems from the notion that it is a "SciKit"
/scikit-learn (htt
(SciPy Toolkit), a separately developed and distributed third-party
ps://[Link]
extension to SciPy.[5] The original codebase was later rewritten by
m/scikit-learn/s
other developers. In 2010, contributors Fabian Pedregosa, Gaël
Varoquaux, Alexandre Gramfort and Vincent Michel, from the cikit-learn)
French Institute for Research in Computer Science and Written in Python,
Automation in Saclay, France, took leadership of the project and Cython, C and
released the first public version of the library on February 1, C++[2]
2010.[6] In November 2012, scikit-learn as well as scikit-image,
were described as two of the "well-maintained and popular" scikits Operating system Linux, macOS,
libraries.[7] In 2019, it was noted that scikit-learn is one of the most Windows
popular machine learning libraries on GitHub.[8] Type Library for
machine
Implementation learning
License New BSD
scikit-learn is largely written in Python, and uses NumPy License
extensively for high-performance linear algebra and array Website [Link]
operations. Furthermore, some core algorithms are written in
([Link]
Cython to improve performance. Support vector machines are
[Link]/)
implemented by a Cython wrapper around LIBSVM; logistic
regression and linear support vector machines by a similar wrapper
around LIBLINEAR. In such cases, extending these methods with Python may not be possible.

scikit-learn integrates well with many other Python libraries, such as Matplotlib and plotly for plotting,
NumPy for array vectorization, Pandas dataframes, SciPy, and many more.

Version history
scikit-learn was initially developed by David Cournapeau as a Google Summer of Code project in 2007.
Later that year, Matthieu Brucher joined the project and started to use it as a part of his thesis work. In
2010, INRIA, the French Institute for Research in Computer Science and Automation, got involved and the
first public release (v0.1 beta) was published in late January 2010.

August 2013. scikit-learn 0.14[9]

July 2014. scikit-learn 0.15.0[9]
March 2015. scikit-learn 0.16.0[9]
November 2015. scikit-learn 0.17.0[9]
September 2016. scikit-learn 0.18.0
July 2017. scikit-learn 0.19.0
September 2018. scikit-learn 0.20.0[10]
May 2019. scikit-learn 0.21.0[11]
December 2019. scikit-learn 0.22[12]
May 2020. scikit-learn 0.23.0[13]
Jan 2021. scikit-learn 0.24[14]
September 2021. scikit-learn 1.0.0[15]
September 2021. scikit-learn 1.0.0[16]
October 2021. scikit-learn 1.0.1[17]
December 2021. scikit-learn 1.0.2[18]
May 2022. scikit-learn 1.1.0[19]
May 2022. scikit-learn 1.1.1[20]
August 2022. scikit-learn 1.1.2[21]
October 2022. scikit-learn 1.1.3[22]
December 2022. scikit-learn 1.2.0[23]
January 2023. scikit-learn 1.2.1[24]
March 2023. scikit-learn 1.2.2[25]

scikit-learn tools
mlpy
SpaCy
NLTK
Orange
PyTorch
TensorFlow
[Link]
List of numerical analysis software
[Link]

References
1. "Release 1.3.0" ([Link] 30 June 2023.
Retrieved 1 July 2023.
2. "The scikit-learn Open Source Project on Open Hub: Languages Page" ([Link]
[Link]/p/scikit-learn/analyses/latest/languages_summary). Open Hub. Retrieved 14 July
2018.
3. Fabian Pedregosa; Gaël Varoquaux; Alexandre Gramfort; Vincent Michel; Bertrand Thirion;
Olivier Grisel; Mathieu Blondel; Peter Prettenhofer; Ron Weiss; Vincent Dubourg; Jake
Vanderplas; Alexandre Passos; David Cournapeau; Matthieu Perrot; Édouard Duchesnay
(2011). "scikit-learn: Machine Learning in Python" ([Link]
html). Journal of Machine Learning Research. 12: 2825–2830.
4. "NumFOCUS Sponsored Projects" ([Link] NumFOCUS.
Retrieved 2021-10-25.
5. Dreijer, Janto. "scikit-learn" ([Link]
6. "About us — scikit-learn 0.20.1 documentation" ([Link]
ory). [Link].
7. Eli Bressert (2012). SciPy and NumPy: an overview for developers ([Link]
m/books?id=fLKTuJqQLVEC&pg=PA43). O'Reilly. p. 43.
8. "The State of the Octoverse: machine learning" ([Link]
he-octoverse-machine-learning/). The GitHub Blog. GitHub. 2019-01-24. Retrieved
2019-10-17.
9. "Release history — scikit-learn 0.19.dev0 documentation" ([Link]
[Link]). [Link]. Retrieved 2017-02-27.
10. "Release History - 0.20.0 documentation" ([Link]
sion-0-20). scikit-learn. Retrieved 6 November 2018.
11. "Release History - 0.21.0 documentation" ([Link]
sion-0-21-0). scikit-learn. Retrieved 5 May 2019.
12. "Release History - 0.22 documentation" ([Link]
scikit-learn. Retrieved 7 June 2020.
13. "Release History - 0.23.0 documentation" ([Link]
#version-0-23-0). scikit-learn. Retrieved 7 June 2020.
14. "Release History - 0.24 documentation" ([Link]
scikit-learn, retrieved 2021-02-08
15. "Release History - 1.0.0 documentation" ([Link]
rsion-1-0-0). scikit-learn.
16. "Release History - 1.0.0 documentation" ([Link]
rsion-1-0-0). scikit-learn.
17. "Release History - 1.0.1 documentation" ([Link]
rsion-1-0-1). scikit-learn.
18. "Release History - 1.0.2 documentation" ([Link]
scikit-learn.
19. "Release History - 1.1.0 documentation" ([Link]
rsion-1-1-0). scikit-learn.
20. "Release History - 1.1.1 documentation" ([Link]
rsion-1-1-1). scikit-learn.
21. "Release History - 1.1.2 documentation" ([Link]
rsion-1-1-2). scikit-learn.
22. "Release History - 1.1.3 documentation" ([Link]
scikit-learn.
23. "Release History - 1.2.0 documentation" ([Link]
rsion-1-2-0). scikit-learn.
24. "Release History - 1.2.1 documentation" ([Link]
rsion-1-2-1). scikit-learn.
25. "Release History - 1.2.2 documentation" ([Link]
scikit-learn.

External links
Official website ([Link]
scikit-learn ([Link] on GitHub

Retrieved from "[Link]

Common questions

Scikit-learn began in 2007 as a Google Summer of Code project, initially developed by David Cournapeau. Since then, it has seen contributions from many developers including contributors from INRIA who took leadership in 2010. The library has evolved through numerous versions, with significant contributions from community developers and institutional support, such as that from NumFOCUS. This continuous improvement and community engagement have established scikit-learn as a premier library in machine learning, influencing its widespread adoption and ongoing development .

Scikit-learn is designed to interoperate seamlessly with NumPy, SciPy, and other Python libraries like Matplotlib and Pandas, allowing for streamlined data manipulation, numerical calculations, and data visualization. This integration enhances machine learning tasks by providing efficient handling of data, access to a wide range of functionalities from data preprocessing to model evaluation, and the ability to easily visualize results .

Being a NumFOCUS-sponsored project offers scikit-learn financial oversight and organizational support, aiding its long-term sustainability. This sponsorship helps ensure stable funding for development activities, infrastructural improvements, and community events, while also providing credibility and fostering an inclusive community around the project, thereby enhancing its growth and reliability .

Scikit-learn's adoption has been positively influenced by its New BSD License, which permits free use, distribution, and modification, encouraging both academic research and commercial applications. This permissive license lowers the barrier to entry for using the library in diverse contexts, fostering innovation and collaboration, hence broadening its user base and facilitating integration into proprietary solutions without legal constraints .

Cython is used in scikit-learn to wrap certain core algorithms like support vector machines for enhanced performance, as it compiles Python code to C for faster execution. Pure Python is used for its ease of readability and rapid development. However, Cython's complexity increases development time and may limit ease of contributions by the broader community. Conversely, while pure Python offers simpler modification and maintenance, it may lead to slower execution in computationally intensive tasks .

Scikit-learn's compatibility with major operating systems such as Linux, macOS, and Windows enables a broad spectrum of users to efficiently run and develop machine learning applications regardless of their platform preference. This cross-platform nature ensures accessibility and facilitates collaborative projects across different environments, enhancing its appeal and usability in both academic research and commercial development .

INRIA played a significant role in scikit-learn's development by providing leadership and resources starting in 2010, which led to the release of the library's first public version. This collaboration with INRIA enabled scikit-learn to gain credibility and facilitated its growth within the academic and industrial communities through heightened visibility and structured development efforts .

Extending scikit-learn methods presents challenges such as ensuring compatibility across Cython, C, and C++ while maintaining readability and simplicity of code, especially when involving performance-critical components. However, the usage of these languages enables significant advantages like faster computation speeds and integration ease with existing scientific libraries, making the library highly efficient for complex machine learning tasks .

David Cournapeau laid the groundwork for scikit-learn, developing its initial vision as a Google Summer of Code project, while Matthieu Brucher contributed through its use and enhancement during his thesis work. These foundational contributions set the technical and collaborative framework for scikit-learn, influencing its architecture and community-driven growth, thus shaping its evolving structure and widespread adoption .

Scikit-learn's inclusion of diverse algorithms such as support-vector machines and random forests offers comprehensive solutions for classification, regression, and clustering tasks in machine learning. These robust, versatile algorithms enhance utility by allowing practitioners to apply sophisticated techniques without reinventing the wheel, hence accelerating the development of predictive models across various domains .

Build Neural Networks in Python
100% (1)
Build Neural Networks in Python
15 pages
NLP and Text Analytics Overview
No ratings yet
NLP and Text Analytics Overview
24 pages
NumPy Basics for Data Science
No ratings yet
NumPy Basics for Data Science
5 pages
Web Technologies for Computer Science Students
No ratings yet
Web Technologies for Computer Science Students
1 page
Feature Selection and Normalization in ML
No ratings yet
Feature Selection and Normalization in ML
8 pages
Python Data Aggregation Techniques
No ratings yet
Python Data Aggregation Techniques
10 pages
Lucene in Action: 2nd Edition Guide
No ratings yet
Lucene in Action: 2nd Edition Guide
10 pages
Hive Database Creation and Analytics
No ratings yet
Hive Database Creation and Analytics
10 pages
GloVe: Word Vector Representations
No ratings yet
GloVe: Word Vector Representations
24 pages
An Introduction To Scilab
No ratings yet
An Introduction To Scilab
27 pages
Overview of DBMS and RDBMS Concepts
No ratings yet
Overview of DBMS and RDBMS Concepts
55 pages
NumPy Array Operations and Computations
No ratings yet
NumPy Array Operations and Computations
15 pages
Data Visualization Techniques with Matplotlib
No ratings yet
Data Visualization Techniques with Matplotlib
40 pages
Introduction to Apache Pig in Big Data
No ratings yet
Introduction to Apache Pig in Big Data
38 pages
Monthly Mutual Fund Investment Summary
No ratings yet
Monthly Mutual Fund Investment Summary
24 pages
Word Vector Extraction in Skip-Gram Model
No ratings yet
Word Vector Extraction in Skip-Gram Model
111 pages
Introduction to Scientific Computing
No ratings yet
Introduction to Scientific Computing
87 pages
Seaborn Assignment Questions and Tasks
No ratings yet
Seaborn Assignment Questions and Tasks
2 pages
Data Visualization with Matplotlib
No ratings yet
Data Visualization with Matplotlib
57 pages
NII Internship Research Topics 2014
No ratings yet
NII Internship Research Topics 2014
9 pages
Book Matlab Document Stats
No ratings yet
Book Matlab Document Stats
2,338 pages
Data Similarity and Dissimilarity Measures
No ratings yet
Data Similarity and Dissimilarity Measures
24 pages
SQL Assignment Questions and Solutions
No ratings yet
SQL Assignment Questions and Solutions
3 pages
Introduction to Database Management Systems
No ratings yet
Introduction to Database Management Systems
38 pages
R Programming Lab Manual R22
No ratings yet
R Programming Lab Manual R22
73 pages
Quantum Computing in Drug Discovery
No ratings yet
Quantum Computing in Drug Discovery
6 pages
Java Programming Overview by Ashwinth
No ratings yet
Java Programming Overview by Ashwinth
79 pages
MIT 6.S191: Intro to Deep Learning 2024
No ratings yet
MIT 6.S191: Intro to Deep Learning 2024
28 pages
Familiarization with Network Devices
No ratings yet
Familiarization with Network Devices
13 pages
Understanding Morphology in NLP
No ratings yet
Understanding Morphology in NLP
11 pages
Machine Vision Lab Manual Overview
No ratings yet
Machine Vision Lab Manual Overview
28 pages
PHP Web Development Overview
No ratings yet
PHP Web Development Overview
65 pages
Automata and Complexity Theory Overview
No ratings yet
Automata and Complexity Theory Overview
68 pages
Introduction to Java Programming
No ratings yet
Introduction to Java Programming
24 pages
Database Systems Concept 5th Edition Silberschatz Korth
No ratings yet
Database Systems Concept 5th Edition Silberschatz Korth
68 pages
Web Scraping and NumPy in Python
No ratings yet
Web Scraping and NumPy in Python
18 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
6 pages
Real-Time Applications and AI Techniques
No ratings yet
Real-Time Applications and AI Techniques
14 pages
Soft Computing Techniques Overview
No ratings yet
Soft Computing Techniques Overview
48 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
58 pages
Types and Importance of Big Data Analytics
No ratings yet
Types and Importance of Big Data Analytics
29 pages
Data Visualization Overview and Best Practices
No ratings yet
Data Visualization Overview and Best Practices
4 pages
Pandas Workshop for Faculty
100% (1)
Pandas Workshop for Faculty
2 pages
Data Structure Lab Manual: Python 313306
No ratings yet
Data Structure Lab Manual: Python 313306
149 pages
Machine Learning Libraries Overview
No ratings yet
Machine Learning Libraries Overview
8 pages
Challenges (NLP) and F C Structure
No ratings yet
Challenges (NLP) and F C Structure
8 pages
Python Programming Lab Manual
No ratings yet
Python Programming Lab Manual
50 pages
Seaborn Visualization Guide
No ratings yet
Seaborn Visualization Guide
20 pages
Local Network Broadcasting Chat Server
83% (6)
Local Network Broadcasting Chat Server
25 pages
R Programming for Data Science Overview
No ratings yet
R Programming for Data Science Overview
59 pages
Scikit-learn: Python Machine Learning Tools
No ratings yet
Scikit-learn: Python Machine Learning Tools
6 pages
Scikit-Learn: Python Machine Learning Library
No ratings yet
Scikit-Learn: Python Machine Learning Library
1 page
API Design Insights for scikit-learn
No ratings yet
API Design Insights for scikit-learn
15 pages
Scikit-learn: Python Machine Learning Guide
No ratings yet
Scikit-learn: Python Machine Learning Guide
7 pages
Scikit-learn: Python Machine Learning Guide
No ratings yet
Scikit-learn: Python Machine Learning Guide
6 pages
Python Libraries for Machine Learning
No ratings yet
Python Libraries for Machine Learning
11 pages
Scikit-learn: Machine Learning in Python
No ratings yet
Scikit-learn: Machine Learning in Python
6 pages
Scikit Learn
No ratings yet
Scikit Learn
1 page
Python for Data Science & ML Guide
No ratings yet
Python for Data Science & ML Guide
5 pages
Install Python, NumPy, Pandas, Scikit-learn
No ratings yet
Install Python, NumPy, Pandas, Scikit-learn
20 pages
Keras Definition
No ratings yet
Keras Definition
2 pages
Overview of Deeplearning4j Framework
No ratings yet
Overview of Deeplearning4j Framework
5 pages
ELKI: Java Framework for Data Mining
No ratings yet
ELKI: Java Framework for Data Mining
7 pages
Overview of Google JAX Framework
No ratings yet
Overview of Google JAX Framework
4 pages
Overview of Apache Mahout
No ratings yet
Overview of Apache Mahout
4 pages
Microsoft Cognitive Toolkit Overview
No ratings yet
Microsoft Cognitive Toolkit Overview
2 pages
Orange Data Mining Toolkit Overview
No ratings yet
Orange Data Mining Toolkit Overview
6 pages
Overview of Torch for Machine Learning
No ratings yet
Overview of Torch for Machine Learning
4 pages
Overview of PyTorch Framework
No ratings yet
Overview of PyTorch Framework
5 pages
TensorFlow Overview and History
No ratings yet
TensorFlow Overview and History
12 pages
Weka Machine Learning Software Overview
No ratings yet
Weka Machine Learning Software Overview
4 pages
XGBoost: Overview and Features
No ratings yet
XGBoost: Overview and Features
4 pages
LLM Installation Guide for Java Users
No ratings yet
LLM Installation Guide for Java Users
3 pages
C Break Statement Explained
No ratings yet
C Break Statement Explained
4 pages
OOP Cheat Sheet by Love Babbar
No ratings yet
OOP Cheat Sheet by Love Babbar
1 page
Python Quiz: Operators and Expressions
No ratings yet
Python Quiz: Operators and Expressions
6 pages
DSU Practical Manual for C Programming
No ratings yet
DSU Practical Manual for C Programming
34 pages
Bank Account Management System Class
No ratings yet
Bank Account Management System Class
4 pages
OpenGL ES 3.0 Properties Overview
No ratings yet
OpenGL ES 3.0 Properties Overview
3 pages
DLL Injection for HD-Player
No ratings yet
DLL Injection for HD-Player
2 pages
Understanding PL/SQL Basics and Structure
No ratings yet
Understanding PL/SQL Basics and Structure
3 pages
PHP Variable Declaration and Rules
No ratings yet
PHP Variable Declaration and Rules
11 pages
C# .NET Developer Interview Guide
No ratings yet
C# .NET Developer Interview Guide
3 pages
Java Servlet and JSP Database Queries
No ratings yet
Java Servlet and JSP Database Queries
7 pages
Benefits of Using .NET Framework
No ratings yet
Benefits of Using .NET Framework
8 pages
C# Exception Handling Techniques
No ratings yet
C# Exception Handling Techniques
35 pages
Exception and Event Handling Overview
No ratings yet
Exception and Event Handling Overview
10 pages
Python Input and Output Functions
No ratings yet
Python Input and Output Functions
10 pages
Mandatory Assignment NR 3 - Introduction To Programming and Application Design
No ratings yet
Mandatory Assignment NR 3 - Introduction To Programming and Application Design
15 pages
Machine Language and Binary System
No ratings yet
Machine Language and Binary System
5 pages
Computer Applications Exam Paper 2024
No ratings yet
Computer Applications Exam Paper 2024
10 pages
C Programs for String Manipulation
No ratings yet
C Programs for String Manipulation
5 pages
Scientific Calculator Project in Python
No ratings yet
Scientific Calculator Project in Python
6 pages
JavaScript Basics and Examples
No ratings yet
JavaScript Basics and Examples
28 pages
Understanding React Hooks Basics
No ratings yet
Understanding React Hooks Basics
19 pages
Minecraft NullPointerException Crash Report
No ratings yet
Minecraft NullPointerException Crash Report
5 pages
Visual Basic 6.0 Overview and Setup
No ratings yet
Visual Basic 6.0 Overview and Setup
14 pages
OPPO CPH1723 Device Information Report
No ratings yet
OPPO CPH1723 Device Information Report
6 pages
Hangman Game Project in Python
No ratings yet
Hangman Game Project in Python
14 pages
Introduction to C and C++ Programming
No ratings yet
Introduction to C and C++ Programming
78 pages
Comprehensive React.js Notes PDF
No ratings yet
Comprehensive React.js Notes PDF
10 pages
Closing Files and Buffer Management
No ratings yet
Closing Files and Buffer Management
129 pages

Scikit-learn Release Notes July 2017

Uploaded by

Scikit-learn Release Notes July 2017

Uploaded by

scikit-learn

scikit-learn (formerly [Link] and also known as sklearn) is

Overview Stable release 1.3.0[1] / 30

August 2013. scikit-learn 0.14[9]

Retrieved from "[Link]

Common questions

In what ways has scikit-learn evolved since its inception in 2007, and how has its development been supported by various contributors and institutions?

How does scikit-learn facilitate integration with other Python libraries and what advantage does this provide for machine learning tasks?

Analyze the significance of scikit-learn being a fiscally sponsored project by NumFOCUS and how this arrangement might affect its development and sustainability.

Assess the impact of scikit-learn’s licensing under the New BSD License on its adoption within the academic and commercial sectors.

Compare and contrast the usage of Cython and pure Python in scikit-learn's implementation of machine learning algorithms. What are the benefits and limitations of each?

Evaluate how scikit-learn’s compatability with operating systems like Linux, macOS, and Windows benefits its user base.

What role did the French Institute for Research in Computer Science and Automation (INRIA) play in the evolution of scikit-learn?

Discuss the challenges and advantages of extending scikit-learn's methods given its foundational languages including Python, Cython, C, and C++.

What contributions have individual developers like David Cournapeau and Matthieu Brucher made to the early development of scikit-learn, and how have these contributions shaped its evolution?

How have scikit-learn’s features like support-vector machines and random forests enhanced its utility in machine learning projects?

You might also like