RapidMiner is a data science software platform developed by the company of the same name that provides an integrated environment for data preparation, machine learning, deep learning, text mining, and predictive analytics. It is used for business and commercial applications as well as for research, education, training, rapid prototyping, and application development and supports all steps of the machine learning process including data preparation, results visualization, model validation and optimization. RapidMiner is developed on an open core model. The RapidMiner Studio Free Edition, which is limited to 1 logical processor and 10,000 data rows is available under the AGPL license, by depending on various non-opensource components. Commercial pricing starts at $5,000 and is available from the developer.
9.4 / 26 September 2019
|Type||Data science, machine learning, predictive analytics|
|License||Free, Professional and Enterprise Editions are Proprietary; Free Edition (10,000 rows and 1 logical processor limit) is available as AGPL|
RapidMiner, formerly known as YALE (Yet Another Learning Environment), was developed starting in 2001 by Ralf Klinkenberg, Ingo Mierswa, and Simon Fischer at the Artificial Intelligence Unit of the Technical University of Dortmund. Starting in 2006, its development was driven by Rapid-I, a company founded by Ingo Mierswa and Ralf Klinkenberg in the same year. In 2007, the name of the software was changed from YALE to RapidMiner. In 2013, the company rebranded from Rapid-I to RapidMiner.
According to Bloor Research, RapidMiner provides 99% of an advanced analytical solution through template-based frameworks that speed delivery and reduce errors by nearly eliminating the need to write code. RapidMiner provides data mining and machine learning procedures including: data loading and transformation (ETL), data preprocessing and visualization, predictive analytics and statistical modeling, evaluation, and deployment. RapidMiner is written in the Java programming language. RapidMiner provides a GUI to design and execute analytical workflows. Those workflows are called “Processes” in RapidMiner and they consist of multiple “Operators”. Each operator performs a single task within the process, and the output of each operator forms the input of the next one. Alternatively, the engine can be called from other programs or used as an API. Individual functions can be called from the command line. RapidMiner provides learning schemes, models and algorithms and can be extended using R and Python scripts.
RapidMiner functionality can be extended with additional plugins which are made available via RapidMiner Marketplace. The RapidMiner Marketplace provides a platform for developers to create data analysis algorithms and publish them to the community.
- RapidMiner Studio
- RapidMiner Auto Model
- RapidMiner Turbo Prep
- RapidMiner Server
- RapidMiner Radoop
In 2019, Gartner placed RapidMiner in the leader quadrant of its Magic Quadrant for Data Science & Machine Learning Platforms for the sixth year in a row. The report noted that RapidMiner provides deep and broad modeling capabilities for automated end-to-end model development. In the 2018 annual software poll, KDnuggets readers voted RapidMiner as one of the most popular data analytics software with the poll’s respondents citing the software package as the tool they use. RapidMiner has received millions of total downloads and has over 400,000 users including BMW, Intel, Cisco, GE, and Samsung as paying customers. RapidMiner claims to be the market leader in the software for data science platforms against competitors such as SAS and IBM.
About 50 developers worldwide participate in the development of the open source RapidMiner with the majority of the contributors being employees of RapidMiner. The company that develops RapidMiner received a $16 million Series C funding with participation from venture capital firms Nokia Growth Partners, Ascent Venture Partners, Longworth Venture Partners, Earlybird Venture Capital and OpenOcean. OpenOcean partner Michael "Monty" Widenius is a founder of MySQL.
- Markus Hofmann, Ralf Klinkenberg, “RapidMiner: Data Mining Use Cases and Business Analytics Applications (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series),” CRC Press, October 25, 2013.
- RapidMiner Embraces its Community and Open Source Culture Delivering Get-More-Open-Core Predictive Analytics, September 1, 2015.
- Guido Deutsch, “RapidMiner from Rapid-I at CeBIT 2010,” Data Mining Blog, March 18, 2010.
- “Interview with RapidMiner's Ingo Mierswa, Ralf Klinkenberg”, KDnuggets, February, 2010.
- “German Predictive Analytics Startup Rapid-I Rebrands As RapidMiner”, TechCrunch, November 4, 2013.
- David Norris, “RapidMiner - a potential game changer,” Bloor Research, November 13, 2013.
- Ajay Ohri, “Interview with Rapid-I Ingo Mierswa and Simon Fischer,” KDnuggets, August 2011.
- "Gartner Magic Quadrant for Data Science and Machine Learning Platforms". RapidMiner. Retrieved 2018-10-05.
- "Python eats away at R: Top Software for Analytics, Data Science, Machine Learning in 2018: Trends and Analysis". www.kdnuggets.com. Retrieved 2018-10-05.
- Ingrid Lunden, “German Predictive Analytics Startup Rapid-I Rebrands As RapidMiner, Takes $5M From Open Ocean, Earlybird To Tackle The U.S. Market,” TechCrunch, November 4, 2013.
- Evan Quinn, “Is Rapid-I the Hidden Giant of Analytics?,” QuinnSight Research, June 17, 2013.