We changed our name from IT Central Station: Here's why

RapidMiner OverviewUNIXBusinessApplication

RapidMiner is #3 ranked solution in top Predictive Analytics tools and #6 ranked solution in top Data Science Platforms. PeerSpot users give RapidMiner an average rating of 8 out of 10. RapidMiner is most commonly compared to KNIME: RapidMiner vs KNIME. RapidMiner is popular among the large enterprise segment, accounting for 61% of users researching this solution on PeerSpot. The top industry researching this solution are professionals from a comms service provider, accounting for 23% of all views.
What is RapidMiner?

RapidMiner's unified data science platform accelerates the building of complete analytical workflows - from data prep to machine learning to model validation to deployment - in a single environment, improving efficiency and shortening the time to value for data science projects.

RapidMiner Buyer's Guide

Download the RapidMiner Buyer's Guide including reviews and more. Updated: January 2022

RapidMiner Customers

PayPal, Deloitte, eBay, Cisco, Miele, Volkswagen

RapidMiner Video

Archived RapidMiner Reviews (more than two years old)

Filter by:
Filter Reviews
Industry
Loading...
Filter Unavailable
Company Size
Loading...
Filter Unavailable
Job Level
Loading...
Filter Unavailable
Rating
Loading...
Filter Unavailable
Considered
Loading...
Filter Unavailable
Order by:
Loading...
  • Date
  • Highest Rating
  • Lowest Rating
  • Review Length
Search:
Showingreviews based on the current filters. Reset all filters
MayankNarula
Senior Manager, Digital Solutions at a tech services company with 1,001-5,000 employees
Real User
Dedicated data science for enterprise-level data collection and analysis but the UI should be enhanced
Pros and Cons
  • "Scalability is not really a concern with RapidMiner. It scales very well and can be used in global implementations."
  • "The visual interface could use something like the-drag-and-drop features which other products already support. Some additional features can make RapidMiner a better tool and maybe more competitive."

What is our primary use case?

Our primary use for this particular product is to provide it for a large medical research company. It is a research company that helps pharmaceutical manufacturers, distributors, and clinics with analysis. Our client collects data that they crunch to provide these types of companies information about trends in distribution and production within the pharmaceutical industry. They use RapidMiner to crunch that data.  

The results of the analysis of the data helps with pricing, it helps with determining volumes, it helps with projections, et cetera. The client has a global license for RapidMiner, so here in the Middle Eastern branch, they use the same product as they do globally.  

What needs improvement?

I think it is a fairly straightforward interface generally. It is an easy-to-use solution compared to SAS Enterprise Miner, for example.  

On the other hand, compared to some other products, maybe the UI could be enhanced. The visual interface could have something like the-drag-and-drop features which Alteryx already supports. Some of those additional features can make RapidMiner a better tool and maybe more competitive or advanced.  

For how long have I used the solution?

We have been working with the product for maybe two years now.  

What do I think about the stability of the solution?

Stability is good with this solution.  

What do I think about the scalability of the solution?

Scalability is not really a concern with RapidMiner. The implementation that I have seen of this product for our client now is very sizeable. It scales very well.  

Which solution did I use previously and why did I switch?

I had previously used other solutions but not as a part of this company. It was a matter of who I was working for.  

How was the initial setup?

The initial setup is something that is supposed to be straightforward according to the people who implemented the solution. I was not part of the setup directly, so I don't know how complicated that is hands-on and so I'm not sure exactly what is involved or the exact amount of time it might take. The initial data mounting might be a big task when it is done globally as it is for this client.  

What's my experience with pricing, setup cost, and licensing?

The current licensing cost is advertised on the website. The client only has to pay the licensing costs. There are not any maintenance or hidden costs in addition to the license.  

Which other solutions did I evaluate?

I used to work with a company where we sold a variety of products as system integrators. So I compared some product because we had to evaluate them to know the advantages and differences. At that time, we got to the stage where we were signing up with Alteryx as a partner.  

So I was in touch with those Alteryx guys and wanted to also compare their product with SAS Enterprise Miner, who we already partnered with, and with RapidMiner as well.  

A couple of customers were already using RapidMiner and we were in the process of partnering with Alteryx. We have to know the other products even if it is through secondary research in addition to what the vendors are presenting.  

What other advice do I have?

On a scale from one to ten where one is the worst and ten is the best, I would rate RapidMiner as around a seven. I choose seven because of the UI things and other parts of the product that might be improved. RapidMiner is more of an enterprise product. Here, in this region, most people like a packaged solution like Alteryx which covers more. Alteryx is also more attractive to many users because it is cheaper and easier to use from the perspective of the user interface.  

With Alteryx or Tableau, for example, you can just pick up data sources and then start EDL (enterprise data lake). It takes more effort to bring the data on to the data mart for RapidMiner and other enterprise products in the traffic mining category. These enterprise solutions have an additional level of complexity and flexibility but not everyone even needs it.  

Which deployment model are you using for this solution?

Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Professor of Health Services Research (now Emeritus) at a university with 1,001-5,000 employees
Real User
Offers good tutorials that make it easy to learn and use, with a powerful feature to compare machine learning algorithms
Pros and Cons
  • "The most valuable feature of RapidMiner is that it can read a large number of file formats including CSV, Excel, and in particular, SPSS."
  • "It would be helpful to have some tutorials on communicating with Python."

What is our primary use case?

Most of my work these days is to encourage younger members of staff in the university, as well as businesses outside, to get involved in data mining or machine learning. My requirement is that the tools have to be functionally good and also economically effective. So, products such as RapidMiner are very useful.

The primary use case at the moment is looking at psychiatric cases to see who is likely to commit suicide and who is not.

What is most valuable?

The most valuable feature of RapidMiner is that it can read a large number of file formats including CSV, Excel, and in particular, SPSS.

RapidMiner is easy for people to get started when they want to use it. They do not have to be very skilled and can easily put up a data flow, read data, remove the missing values, tidy up the data and then put it through a decision tree or regression analysis.

It is easy to add functionality using Python or R code.

Their tutorials, including the videos, are very good.

Another powerful feature is that it will try out a large variety of machine learning algorithms and then compare them. If you're a beginner, it's a good way of getting started because it will point you in the right direction. For example, it will point you towards decision trees or support vector machines, and it will tell you why.

What needs improvement?

When I started using RapidMiner, I found it difficult to get it to read the metadata. I wanted to use, for example, a pivot table, and it did not have the variable or the attribute names in it. There were no values. It took a long while to figure out how to do that, although it tends to do it automatically nowadays.

RapidMiner is not utterly intuitive for beginners. Sometimes people have trouble distinguishing between a file in their own file system and a repository entry, and they cannot find their data. This is an area where this solution could be improved.

It would be helpful to have some tutorials on communicating with Python. I found it a bit difficult at times to figure out which particular variable, or attribute, is going where in Python. It is probably a simple thing to do but I haven't mastered it yet. I'd like them to do a video on that. There are a large number of videos that are usually well-produced, but I don't think that they have one on that. Essentially, I would like to see how to communicate from RapidMiner to Python and from Python to RapidMiner.

One of the things I do a lot of is looking at questionnaires where people have used Likert-type scales. I don't recommend Likert-type scales, but if they're properly produced, which is a lot of hard work and it's not usually done, they're really powerful and you can do things like normalizing holes on the Likert scale. That's not the same as normalizing your data in RapidMiner. So, I would want to get results with these Likert scales, pass it through RapidMiner, do a normalization and pass back both the raw scores and the normalized scores and put in some rules, which will say if it's high on the raw score and on the normalized score and low on the standard deviation, then you can trust it.

For how long have I used the solution?

I have been using RapidMiner for three years.

What do I think about the stability of the solution?

I have not had any particular problems with stability but I've not really put it to the test with any really big datasets yet. 

What do I think about the scalability of the solution?

I have not yet been able to assess scalability. If my university acquires the server version then this is something that I will be able to test. As so far I have been using the desktop version, the question has not yet arisen.

How are customer service and technical support?

I have not contacted technical support. I've only used the user forum.

Which solution did I use previously and why did I switch?

I have also used KNIME and IBM SPSS Modeler (formerly Clementine).

I used IBM Modeler for a very big project that involved building a rule-based system for controlling sewage. We experienced the knowledge acquisition bottleneck during the interview process, as we interviewed experts and they could not tell us what the rules were. If we asked them what to do if the sewage became smelly and the water stopped flowing, they could not tell us a specific solution such as "If this chemical is a certain level and the temperature is above a certain value and the slope or the ground is more than a certain amount...", etc. Rather, they would give us anecdotal examples such as "Last week, we had this case where..." Using all of these examples, we were able to generate rules using machine learning.

We have also used VisiRule in several projects, which is an expert system building tool. We are thinking about using it again now, and I contacted them only a month ago. They can take a PMML file and turn it into an expert system. The question was whether they could take a dataset and create a PMML file, and with RapidMiner, I could.

The main competitor to RapidMiner is KNIME and it is much more cumbersome when you have to read SPSS files. You would first need to put in some R code in order to do that.

How was the initial setup?

My first deployment was trivial because I used to use the Iris data set, which took me six or seven minutes to set up. With my current larger dataset, it has taken four or five weeks.

What's my experience with pricing, setup cost, and licensing?

RapidMiner is adopting the same principle that SPSS has used, which makes it cheap or free at the university. When people graduate and go out to work in the industry, they are already familiar with the tool.

Although we don't pay licensing fees because it is being used within the university, my understanding is that the cost is between $5,000 and $10,000 USD per year.

Which other solutions did I evaluate?

I am also familiar with Weka and Alteryx, but I have not used them for real-world problems.

The trouble is that so many of these products now are almost identical in what they offer you. They learn from each other. So, they compete really on marginal advantages here and there such as cost, ease of use, and ease of learning.

I know businesses that have invested in RapidMiner, but I also know that many will go for KNIME because it is so cheap. It's free. It's more cumbersome to use, but after perhaps a week or ten days, you know it well enough.

What other advice do I have?

Using RapidMiner is a two-stage process. At first, it's something simple whereby you can get quick results. This is done by clicking to get the mean standard deviations or the numerical variables, for example. You can get a bar chart and a frequency count of all of the categorical variables. I would suggest that you get somebody to do that, just to get used to it, but then stop them and make sure that they do a course on machine learning. Otherwise, they may be missing something important like cleaning up the data.

For example, I did one project many years ago whereby I was asked by the Department of Education in the UK to look at the survey data on primary school children. It had been done by a market research company and they were a bit uneasy about the results. They didn't know what was wrong with the results, but they felt they weren't right, so they asked me to look at it. The first thing I did was to take a simple look at the values of all the variables and the first thing that became clear was that on the bar chart of variables, the right-hand end shot up. This was the value 99, which was clearly a missing value.

Now it was a missing value, but in SPSS, which is what they use to finalize it, they had not declared it to be a missing value, so it found a child whose age is apparently 99 and they treated that child as being age 99. I found that out very easily by working out the arithmetic mean age of these primary school children, who should be under the age of 10, and their average age was 34.4. That came up merely because they hadn't specified a missing value.

Now that's a very simple example, but it's the sort of thing that can go wrong when people just use a package and they don't know what the underlying assumptions are. Or people produce a linear regression when the relationship is nowhere linear. I recently refused to referee an article, for example, from China, because they did linear regression on data which clearly were not linear. They were exponential in format.

All this to say that this is the two-stage process. You can get started very quickly, but you must then make sure that your staff is properly trained not to make these kinds of mistakes.

The beginning learning curve is very shallow, but when you want to go on and do really advanced things then it takes more time. Companies know this, so they try to find cheap solutions such as employing sociology graduates to use the software. They don't understand the issues the same way a computer science or mathematics graduate would.

With respect to functionality, at the moment it has more features than I need and can handle. 

I would rate this solution as nine out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Learn what your peers think about RapidMiner. Get advice and tips from experienced pros sharing their opinions. Updated: January 2022.
564,143 professionals have used our research since 2012.
Pre-Sales BI at a wholesaler/distributor with 10,001+ employees
Real User
An excellent software platform that extracts information and data
Pros and Cons
  • "The most valuable feature is what the product sets out to do, which is extracting information and data."
  • "A great product but confusing in some way with regard to the user interface and integration with other tools."

What is our primary use case?

I work in business intelligence pre-sales and am a customer of RapidMiner. I'm using the product for data mining. I'm involved in data science and doing a Masters degree in the field so I'm aiming to explore the data I have gathered. I have some experience with the product but it's not that deep. 

What is most valuable?

The most valuable feature is what the product sets out to do, which is extracting information and data. 

What needs improvement?

I think it's a great product but confusing in some way with regard to the user interface and integration with other tools. An improvement would be the addition of some buttons which would be useful because I'm sometimes unsure why I need to use something or what is its purpose. 

I would say the same goes for additional features, the addition of buttons would be helpful. The product is better than other software that I use. 

For how long have I used the solution?

I've been using the product for two or three months although I'm not using the latest version. 

What do I think about the stability of the solution?

I believe the scalability is very good. 

What do I think about the scalability of the solution?

The solution is stable. I think the tools the product provides are all that I require for my specific needs. 

How are customer service and technical support?

Normally I use the community for support, and don't make contact with customer service. I have some friends who are working in that area and they help me. I also ask my coworkers for assistance. 

What other advice do I have?

The tools have a complete function for doing data. I'm not quite sure about the speed of RapidMiner but I think it's the fastest solution that I use. I don't think the product consumes a lot of RAM, which is good. 

There is something confusing in the product but it's possible that the error is mine and maybe I'm not yet familiar enough with the product.

I would therefore rate this product a nine out of 10. 

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Data Scientist at a tech company with 51-200 employees
Real User
Good data preparation, auto modeling, and forecasting features

What is our primary use case?

My primary use cases for RapidMiner are data cleaning, preparation, and forecasting. We are in the telecom industry.

What is most valuable?

The most valuable features are the Binary classification and Auto Model.

What needs improvement?

RapidMiner would be improved with the inclusion of more machine learning algorithms for generating time-series forecasting models.

For how long have I used the solution?

I have been using RapidMiner for about one year.

What do I think about the stability of the solution?

The stability of RapidMiner is good.

What do I think about the scalability of the solution?

This is a scalable solution.

How are customer service and technical support?

The technical support for RapidMiner is fantastic.

How was the

What is our primary use case?

My primary use cases for RapidMiner are data cleaning, preparation, and forecasting. We are in the telecom industry.

What is most valuable?

The most valuable features are the Binary classification and Auto Model.

What needs improvement?

RapidMiner would be improved with the inclusion of more machine learning algorithms for generating time-series forecasting models.

For how long have I used the solution?

I have been using RapidMiner for about one year.

What do I think about the stability of the solution?

The stability of RapidMiner is good.

What do I think about the scalability of the solution?

This is a scalable solution.

How are customer service and technical support?

The technical support for RapidMiner is fantastic.

How was the initial setup?

The initial setup is easy.

What other advice do I have?

I would rate this solution a nine out of ten.

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Disclosure: I am a real user, and this review is based on my own experience and opinions.
it_user1025025
Project Engineer at a engineering company with 10,001+ employees
Real User
Good documentation and a clear structure between building processes and collecting results

What is our primary use case?

My primary use case for this solution is education. We did a course at the university and I used this for my own education, as opposed to a business use case.

What is most valuable?

The documentation for this solution is very good, where each operator is explained with how to use it. There is a very clear structure between building a process and then looking at the results.

What needs improvement?

The price of this solution should be improved.

For how long have I used the solution?

I have been using this solution for between six and twelve months.

What do I think about the stability of the solution?

I have had no problems with stability.

What do I think about the scalability of the solution?

I only used this solution locally, on my…

What is our primary use case?

My primary use case for this solution is education.

We did a course at the university and I used this for my own education, as opposed to a business use case.

What is most valuable?

The documentation for this solution is very good, where each operator is explained with how to use it.

There is a very clear structure between building a process and then looking at the results.

What needs improvement?

The price of this solution should be improved.

For how long have I used the solution?

I have been using this solution for between six and twelve months.

What do I think about the stability of the solution?

I have had no problems with stability.

What do I think about the scalability of the solution?

I only used this solution locally, on my laptop.

How are customer service and technical support?

I was only in touch with technical support for a licensing issue, and it worked out fine. I have no complaints.

How was the initial setup?

The initial setup of this solution is pretty straightforward and easy to understand.

What about the implementation team?

I installed this solution myself, on my laptop.

What's my experience with pricing, setup cost, and licensing?

I used an educational license for this solution, which is available free of charge.

What other advice do I have?

This is a solution that I recommend.

I would rate this solution an eight out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
DerekWilson
President and CEO at a tech services company with 1-10 employees
Real User
Being able to drag and drop controls enables me to focus on building the best model
Pros and Cons
  • "I like not having to write all solutions from code. Being able to drag and drop controls, enables me to focus on building the best model, without needing to search for syntax errors or extra libraries."
  • "The server product has been getting updated and continues to be better each release. When I started using RapidMiner, it was solid but not easy to set up and upgrade."

What is our primary use case?

Predicting which customers were most likely to churn for subscription-based contracts. Both RapidMiner Studio and RapidMiner Server were used to build, model, and deploy to production.

How has it helped my organization?

RapidMiner has moved companies toward being more data-driven. Decisions are made based on analysis of data and rapid prototyping, instead of gut feel.

What is most valuable?

I like not having to write all solutions from code. Being able to drag and drop controls, enables me to focus on building the best model, without needing to search for syntax errors or extra libraries.

What needs improvement?

The server product has been getting updated and continues to be better each release. When I started using RapidMiner, it was solid but not easy to set up and upgrade.

For how long have I used the solution?

Three to five years.

What other advice do I have?

Overall, I would rate RapidMiner at eight of 10. I have been able to use it effectively for all required use cases.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
it_user719814
Senior Community Manager at a tech vendor with 51-200 employees
Real User
Features Include Cutting-Edge Data Science Tools and Algorithms

What is our primary use case?

RapidMiner is primarily used to develop and deploy machine learning workflows to real business use cases, e.g. predictive analytics, segmentation, etc...

How has it helped my organization?

As a data science consultant, I was able to custom-build a machine learning lead generation (web scraping) tool for a client and have it be operational in 72 hours.

What is most valuable?

  1. Availability of cutting-edge data science tools and algorithms
  2. Ease of code-optional GUI
  3. Open Source Java core
  4. Easy integration with APIs, Python, R, cloud storage, cloud computing, etc.

What needs improvement?

I would like to see wider adoption of the RapidMiner platform by the Open Source community as a viable alternative/companion to Python and R.

For how long have I used the solution?

More than five years.

What do I think about the stability of the solution?

No.

What do I think about the scalability of the solution?

No.

How are customer service and technical support?

Superb.

Which solution did I use previously and why did I switch?

I used to use Excel. It was horrid.

How was the initial setup?

Not at all. Lots of YouTube videos and documentation. Super easy.

What's my experience with pricing, setup cost, and licensing?

RapidMiner has a freemium pricing model so, as long as your dataset has fewer than 10,000 rows, you have free software. There is also free academic licensing for Higher Ed or researchers. For companies, RapidMiner's paid license is much cheaper than SAS, SPSS, or other competitors. You definitely get your money's worth.

Which other solutions did I evaluate?

SAS and SPSS.

What other advice do I have?

Sign up and read what others do with RapidMiner on the user community (community.rapidminer.com).

Disclosure: I am a real user, and this review is based on my own experience and opinions.
Alfonso Gordillo
Director Research and Development at Projects Factory
Real User
Using the GUI, I can have models and algorithms drag and drop nodes

What is our primary use case?

I am testing RapidMiner as I am beginning data science. I will work with RapidMiner and Azure Machine Learning Studio. RapidMiner for Windows is an excellent graphical tool for data science.

How has it helped my organization?

I am using RapidMiner to generate insights for predictive maintenance in the Oil and Gas Industry, mainly for the electrical submersible pump system, including surface and down hole assets.

What is most valuable?

No code is necessary, no R and Python skills.  Using the GUI, I can have models and algorithms drag and drop nodes.

What needs improvement?

Improve the online data services.

For how long have I used the solution?

Trial/evaluations only.

What is our primary use case?

I am testing RapidMiner as I am beginning data science. I will work with RapidMiner and Azure Machine Learning Studio. RapidMiner for Windows is an excellent graphical tool for data science.

How has it helped my organization?

I am using RapidMiner to generate insights for predictive maintenance in the Oil and Gas Industry, mainly for the electrical submersible pump system, including surface and down hole assets.

What is most valuable?

  • No code is necessary, no R and Python skills. 
  • Using the GUI, I can have models and algorithms drag and drop nodes.

What needs improvement?

Improve the online data services.

For how long have I used the solution?

Trial/evaluations only.
Disclosure: I am a real user, and this review is based on my own experience and opinions.