We just raised a $30M Series A: Read our story

RapidMiner OverviewUNIXBusinessApplication

RapidMiner is #3 ranked solution in top Predictive Analytics tools and #6 ranked solution in top Data Science Platforms. IT Central Station users give RapidMiner an average rating of 8 out of 10. RapidMiner is most commonly compared to KNIME:RapidMiner vs KNIME. RapidMiner is popular among the large enterprise segment, accounting for 70% of users researching this solution on IT Central Station. The top industry researching this solution are professionals from a comms service provider, accounting for 22% of all views.
What is RapidMiner?

RapidMiner's unified data science platform accelerates the building of complete analytical workflows - from data prep to machine learning to model validation to deployment - in a single environment, improving efficiency and shortening the time to value for data science projects.

RapidMiner Buyer's Guide

Download the RapidMiner Buyer's Guide including reviews and more. Updated: November 2021

RapidMiner Customers

PayPal, Deloitte, eBay, Cisco, Miele, Volkswagen

RapidMiner Video

Pricing Advice

What users are saying about RapidMiner pricing:
  • "Although we don't pay licensing fees because it is being used within the university, my understanding is that the cost is between $5,000 and $10,000 USD per year."
  • "The client only has to pay the licensing costs. There are not any maintenance or hidden costs in addition to the license."

RapidMiner Reviews

Filter by:
Filter Reviews
Industry
Loading...
Filter Unavailable
Company Size
Loading...
Filter Unavailable
Job Level
Loading...
Filter Unavailable
Rating
Loading...
Filter Unavailable
Considered
Loading...
Filter Unavailable
Order by:
Loading...
  • Date
  • Highest Rating
  • Lowest Rating
  • Review Length
Search:
Showingreviews based on the current filters. Reset all filters
LM
Professor of Health Services Research (now Emeritus) at a university with 1,001-5,000 employees
Real User
Top 5
Offers good tutorials that make it easy to learn and use, with a powerful feature to compare machine learning algorithms

Pros and Cons

  • "The most valuable feature of RapidMiner is that it can read a large number of file formats including CSV, Excel, and in particular, SPSS."
  • "It would be helpful to have some tutorials on communicating with Python."

What is our primary use case?

Most of my work these days is to encourage younger members of staff in the university, as well as businesses outside, to get involved in data mining or machine learning. My requirement is that the tools have to be functionally good and also economically effective. So, products such as RapidMiner are very useful.

The primary use case at the moment is looking at psychiatric cases to see who is likely to commit suicide and who is not.

What is most valuable?

The most valuable feature of RapidMiner is that it can read a large number of file formats including CSV, Excel, and in particular, SPSS.

RapidMiner is easy for people to get started when they want to use it. They do not have to be very skilled and can easily put up a data flow, read data, remove the missing values, tidy up the data and then put it through a decision tree or regression analysis.

It is easy to add functionality using Python or R code.

Their tutorials, including the videos, are very good.

Another powerful feature is that it will try out a large variety of machine learning algorithms and then compare them. If you're a beginner, it's a good way of getting started because it will point you in the right direction. For example, it will point you towards decision trees or support vector machines, and it will tell you why.

What needs improvement?

When I started using RapidMiner, I found it difficult to get it to read the metadata. I wanted to use, for example, a pivot table, and it did not have the variable or the attribute names in it. There were no values. It took a long while to figure out how to do that, although it tends to do it automatically nowadays.

RapidMiner is not utterly intuitive for beginners. Sometimes people have trouble distinguishing between a file in their own file system and a repository entry, and they cannot find their data. This is an area where this solution could be improved.

It would be helpful to have some tutorials on communicating with Python. I found it a bit difficult at times to figure out which particular variable, or attribute, is going where in Python. It is probably a simple thing to do but I haven't mastered it yet. I'd like them to do a video on that. There are a large number of videos that are usually well-produced, but I don't think that they have one on that. Essentially, I would like to see how to communicate from RapidMiner to Python and from Python to RapidMiner.

One of the things I do a lot of is looking at questionnaires where people have used Likert-type scales. I don't recommend Likert-type scales, but if they're properly produced, which is a lot of hard work and it's not usually done, they're really powerful and you can do things like normalizing holes on the Likert scale. That's not the same as normalizing your data in RapidMiner. So, I would want to get results with these Likert scales, pass it through RapidMiner, do a normalization and pass back both the raw scores and the normalized scores and put in some rules, which will say if it's high on the raw score and on the normalized score and low on the standard deviation, then you can trust it.

For how long have I used the solution?

I have been using RapidMiner for three years.

What do I think about the stability of the solution?

I have not had any particular problems with stability but I've not really put it to the test with any really big datasets yet. 

What do I think about the scalability of the solution?

I have not yet been able to assess scalability. If my university acquires the server version then this is something that I will be able to test. As so far I have been using the desktop version, the question has not yet arisen.

How are customer service and technical support?

I have not contacted technical support. I've only used the user forum.

Which solution did I use previously and why did I switch?

I have also used KNIME and IBM SPSS Modeler (formerly Clementine).

I used IBM Modeler for a very big project that involved building a rule-based system for controlling sewage. We experienced the knowledge acquisition bottleneck during the interview process, as we interviewed experts and they could not tell us what the rules were. If we asked them what to do if the sewage became smelly and the water stopped flowing, they could not tell us a specific solution such as "If this chemical is a certain level and the temperature is above a certain value and the slope or the ground is more than a certain amount...", etc. Rather, they would give us anecdotal examples such as "Last week, we had this case where..." Using all of these examples, we were able to generate rules using machine learning.

We have also used VisiRule in several projects, which is an expert system building tool. We are thinking about using it again now, and I contacted them only a month ago. They can take a PMML file and turn it into an expert system. The question was whether they could take a dataset and create a PMML file, and with RapidMiner, I could.

The main competitor to RapidMiner is KNIME and it is much more cumbersome when you have to read SPSS files. You would first need to put in some R code in order to do that.

How was the initial setup?

My first deployment was trivial because I used to use the Iris data set, which took me six or seven minutes to set up. With my current larger dataset, it has taken four or five weeks.

What's my experience with pricing, setup cost, and licensing?

RapidMiner is adopting the same principle that SPSS has used, which makes it cheap or free at the university. When people graduate and go out to work in the industry, they are already familiar with the tool.

Although we don't pay licensing fees because it is being used within the university, my understanding is that the cost is between $5,000 and $10,000 USD per year.

Which other solutions did I evaluate?

I am also familiar with Weka and Alteryx, but I have not used them for real-world problems.

The trouble is that so many of these products now are almost identical in what they offer you. They learn from each other. So, they compete really on marginal advantages here and there such as cost, ease of use, and ease of learning.

I know businesses that have invested in RapidMiner, but I also know that many will go for KNIME because it is so cheap. It's free. It's more cumbersome to use, but after perhaps a week or ten days, you know it well enough.

What other advice do I have?

Using RapidMiner is a two-stage process. At first, it's something simple whereby you can get quick results. This is done by clicking to get the mean standard deviations or the numerical variables, for example. You can get a bar chart and a frequency count of all of the categorical variables. I would suggest that you get somebody to do that, just to get used to it, but then stop them and make sure that they do a course on machine learning. Otherwise, they may be missing something important like cleaning up the data.

For example, I did one project many years ago whereby I was asked by the Department of Education in the UK to look at the survey data on primary school children. It had been done by a market research company and they were a bit uneasy about the results. They didn't know what was wrong with the results, but they felt they weren't right, so they asked me to look at it. The first thing I did was to take a simple look at the values of all the variables and the first thing that became clear was that on the bar chart of variables, the right-hand end shot up. This was the value 99, which was clearly a missing value.

Now it was a missing value, but in SPSS, which is what they use to finalize it, they had not declared it to be a missing value, so it found a child whose age is apparently 99 and they treated that child as being age 99. I found that out very easily by working out the arithmetic mean age of these primary school children, who should be under the age of 10, and their average age was 34.4. That came up merely because they hadn't specified a missing value.

Now that's a very simple example, but it's the sort of thing that can go wrong when people just use a package and they don't know what the underlying assumptions are. Or people produce a linear regression when the relationship is nowhere linear. I recently refused to referee an article, for example, from China, because they did linear regression on data which clearly were not linear. They were exponential in format.

All this to say that this is the two-stage process. You can get started very quickly, but you must then make sure that your staff is properly trained not to make these kinds of mistakes.

The beginning learning curve is very shallow, but when you want to go on and do really advanced things then it takes more time. Companies know this, so they try to find cheap solutions such as employing sociology graduates to use the software. They don't understand the issues the same way a computer science or mathematics graduate would.

With respect to functionality, at the moment it has more features than I need and can handle. 

I would rate this solution as nine out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
RajivSharma
Senior Product Manager at CustomerXps Software
Real User
Top 5Leaderboard
Extensive features, Turbo Prep, Auto ML, good GUI and good stability

Pros and Cons

  • "The GUI capabilities of the solution are excellent. Their Auto ML model provides for even non-coder data scientists to deploy a model."
  • "The biggest problem, not from a platform process, but from an avoidance process, is when you work in a heavily regulated environment, like banking and finance. Whenever you make a decision or there is an output, you need to bill it as an avoidance to the investigator or to the bank audit team. If you made decisions within this machine learning model, you need to explain why you did so. It would better if you could explain your decision in terms of delivery. However, this is an issue with all ML platforms. Many companies are working heavily in this area to help figure out how to make it more explainable to the business team or the regulator."

What is our primary use case?

We primarily use the solution for training and deploying various supervised and unsupervised models in the area of financial crime management.

How has it helped my organization?

It enables banks to quickly try and experiment multiple algorithms on same data set without the worry to have full time data scientists working. Focus shifts to data procurement, feature engineering and model validation rather than to worry about coding the same in other languages.

What is most valuable?

The GUI capabilities of the solution are excellent. Their Auto ML model provides for even non-coder data scientists to deploy a model.

The features the solution offers are quite extensive. We haven't had a chance to utilize all of them yet. The solution is constantly evolving to continue to be cutting edge.

What needs improvement?

The biggest problem, not from a platform process, but from an avoidance process, is when you work in a heavily regulated BFSI environment, like banking and finance. Whenever you make a decision or there is an output, you need to bill it as an avoidance to the investigator or to the bank audit team.

If you made decisions within this machine learning model, you need to explain why you did so. It would better if you could explain your decision in terms of delivery.  However, this is an issue with all ML platforms. Many companies are working heavily in this area to help figure out how to make it more explainable to the business team or the regulator.

For how long have I used the solution?

I've been using the solution for about 3 years.

What do I think about the stability of the solution?

The stability of the solution is good. We haven't faced any bugs or glitches. However, it does depend on the data model you are working on. So far, for example, we haven't modeled a big data site, so I'm not sure what stability would be affected in a case like that.

What do I think about the scalability of the solution?

In terms of scaling, it depends on the model you are deploying. When you build a model in a batch, scale is not an issue, because you can run it for hours or days to train the model. Currently, our company works within banks where we process around 10-20,000,000 transactions per day on a single site in the bank. 

You need to balance everything. In a real-time system, like the way we are operating, where we have a high case of having to send a response in less than one second, we need to have a balance. There are various ways we make sure, according to the deployment model, that we can respond within that one-second timeframe.

 Typically the kinds of people using the solution are data scientists and data analysts.

How are customer service and technical support?

In terms of technical support, whenever we face issues, the first place we go to are online forums or the solution's blogs. Typically, we can find an answer to our issues there. If there are issues that need to be fixed, they do offer extensions where you can write your own Python or R program to address them.

Which solution did I use previously and why did I switch?

Before using this solution, we were mostly working on a native R and Python-based platform. It was more of an open-source tool. This is the first commercial tool we have used.

How was the initial setup?

In terms of the initial setup, you don't need to install the solution because it's a desktop version. It's the server of the deployment model which is a bit more complex. However, a desktop version is a standalone application and it's pretty straightforward. 

Unless you have some preliminary understanding of how machine learning models work, you will not be able to use the tool. It's not just with RapidMiner, it's on any tool. You have to check the parameters for every algorithm and you need to understand how algorithms work. Even with the excellent GUI and auto model capabilities, you'll still need the have a decent level of data science or machine learning knowledge.

What about the implementation team?

We have an in-house team of data engineers, data analysts, subject matter experts, data scientists and ML engineers who collaborate with bank's IT and business team to deliver the solution. This is handled by dedicated team working under Professional consulting group.

What was our ROI?

This is confidential as banks do not usually share this information. However, given the ML platform with auto model capability, I can say ROI would easily exceed at least 90%. This again depends on how many models are trained and deployed on a regular basis.

What's my experience with pricing, setup cost, and licensing?

Within the company, we have about seven user licenses. When it comes to clients, they typically only have one license, which is more than enough for their use.

Which other solutions did I evaluate?

We did evaluate a few others like IBM SPSS. But Rapidminer is very user friendly and has a robust rating on various leading portal like KDnuggets. 

What other advice do I have?

We're in the banking and finance space, so mostly our clients use the on-premises deployment model. As part of compliance, it's required that data should not go out of the bank's boundaries or firewall.

This solution is a great tool for users that are experimenting and is an alternative to doing the coding and everything themselves. It's perfect for those who want to focus more on data analysis rather than spending days coding everything. Users can go pretty far because of the solution's Auto ML capability which cuts down on coding. It allows for great productivity.

I'd rate the solution eight out of ten.

Which deployment model are you using for this solution?

On-premises

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
Learn what your peers think about RapidMiner. Get advice and tips from experienced pros sharing their opinions. Updated: November 2021.
554,676 professionals have used our research since 2012.
MayankNarula
Senior Manager, Digital Solutions at a tech services company with 1,001-5,000 employees
Real User
Top 5
Dedicated data science for enterprise-level data collection and analysis but the UI should be enhanced

Pros and Cons

  • "Scalability is not really a concern with RapidMiner. It scales very well and can be used in global implementations."
  • "The visual interface could use something like the-drag-and-drop features which other products already support. Some additional features can make RapidMiner a better tool and maybe more competitive."

What is our primary use case?

Our primary use for this particular product is to provide it for a large medical research company. It is a research company that helps pharmaceutical manufacturers, distributors, and clinics with analysis. Our client collects data that they crunch to provide these types of companies information about trends in distribution and production within the pharmaceutical industry. They use RapidMiner to crunch that data.  

The results of the analysis of the data helps with pricing, it helps with determining volumes, it helps with projections, et cetera. The client has a global license for RapidMiner, so here in the Middle Eastern branch, they use the same product as they do globally.  

What needs improvement?

I think it is a fairly straightforward interface generally. It is an easy-to-use solution compared to SAS Enterprise Miner, for example.  

On the other hand, compared to some other products, maybe the UI could be enhanced. The visual interface could have something like the-drag-and-drop features which Alteryx already supports. Some of those additional features can make RapidMiner a better tool and maybe more competitive or advanced.  

For how long have I used the solution?

We have been working with the product for maybe two years now.  

What do I think about the stability of the solution?

Stability is good with this solution.  

What do I think about the scalability of the solution?

Scalability is not really a concern with RapidMiner. The implementation that I have seen of this product for our client now is very sizeable. It scales very well.  

Which solution did I use previously and why did I switch?

I had previously used other solutions but not as a part of this company. It was a matter of who I was working for.  

How was the initial setup?

The initial setup is something that is supposed to be straightforward according to the people who implemented the solution. I was not part of the setup directly, so I don't know how complicated that is hands-on and so I'm not sure exactly what is involved or the exact amount of time it might take. The initial data mounting might be a big task when it is done globally as it is for this client.  

What's my experience with pricing, setup cost, and licensing?

The current licensing cost is advertised on the website. The client only has to pay the licensing costs. There are not any maintenance or hidden costs in addition to the license.  

Which other solutions did I evaluate?

I used to work with a company where we sold a variety of products as system integrators. So I compared some product because we had to evaluate them to know the advantages and differences. At that time, we got to the stage where we were signing up with Alteryx as a partner.  

So I was in touch with those Alteryx guys and wanted to also compare their product with SAS Enterprise Miner, who we already partnered with, and with RapidMiner as well.  

A couple of customers were already using RapidMiner and we were in the process of partnering with Alteryx. We have to know the other products even if it is through secondary research in addition to what the vendors are presenting.  

What other advice do I have?

On a scale from one to ten where one is the worst and ten is the best, I would rate RapidMiner as around a seven. I choose seven because of the UI things and other parts of the product that might be improved. RapidMiner is more of an enterprise product. Here, in this region, most people like a packaged solution like Alteryx which covers more. Alteryx is also more attractive to many users because it is cheaper and easier to use from the perspective of the user interface.  

With Alteryx or Tableau, for example, you can just pick up data sources and then start EDL (enterprise data lake). It takes more effort to bring the data on to the data mart for RapidMiner and other enterprise products in the traffic mining category. These enterprise solutions have an additional level of complexity and flexibility but not everyone even needs it.  

Which deployment model are you using for this solution?

Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
AB
DATA STRATEGY at a tech services company with 51-200 employees
Real User
Top 10
Fast prototyping, good for data science, and has a good onboarding experience

Pros and Cons

  • "The data science, collaboration, and IDN are very, very strong."
  • "In the Mexican or Latin American market, it's kind of pricey."

What is our primary use case?

The solution is used for market-based analysis. that said, things like scoring models, predictive models, forecasting, have not yet been implemented. However, they have a lot of traction on industry 4.0. We have not just yet gotten into that. 

What is most valuable?

Since the beginning, we had a really nice onboarding experience with RapidMiner.

The solution is fast when it comes to prototyping. The prep and auto model feature is something that people really like. They help you prototype a use case very quickly. The quick prototyping features that are included in the software get everything ready - including the model instructions and all that.

I do see RapidMiner as much more of a data science platform, and not really an art restoration platform. The data science, collaboration, and IDN are very, very strong.

What needs improvement?

In the Mexican or Latin American market, it's kind of pricey. 

The pricing can be a bit high.

Some of the data science platforms offer much more flexibility. Of course, there's not the same software for visual license results. It's somehow rigid.

I'd like to have a module for analytics there. For example, the capability of keeping track of changes in every version would be helpful. It was very, very difficult to track. 

Even as a partner, it is difficult to keep up with whatever changes they have in mind. On the commercial side, it has been the same. However, since I started, every three months, they propose a different commercial scheme. It's one of the reasons that they got lower marks on the Gardner report.

The UI is not super intuitive. It might be nice if, on the first time a person uses the product, there was a wizard that could walk a person through everything. It's supposedly very intuitive, and yet, I don't know what to, I don't know where to click, honestly. They need to offer a better-guided experience for beginners.

For how long have I used the solution?

While the company likely has five or so years of experience with the solution, I only have three. However, I have had a lot of time to work with it. I've only been at the company for three years, which is why there is a time difference.

What do I think about the scalability of the solution?

The scalability is important. For example, it's very easy to quote the desktop software. They tell, for example, how much data science users are going to be working with on the machines, in terms of creating models. You need to know how many concurrent users there are going to be. For that, you will have to quote the RapidMiner server. It can get very complex quoting once you get into operationalizing your models. 

They don't fully help you figure things out in terms of big accounts and scenarios. We'd like to have someone really technical assist. That type of person would know what questions to ask.

How was the initial setup?

In terms of implementation, I will say that, that if a user gets the desktop version without the server, it's very difficult for them to actually get value out of the product. RapidMinor has discovered that the churn on the desktop product is very high. If you're going to get the desktop version, no worries, however, you have to have a clear strategy on how do the outputs of your models are going to make an impact on the business because the impact is not clearly seen sometimes.

What's my experience with pricing, setup cost, and licensing?

The solution is considered expensive, at least in the Latin American market. They do try their best to give discounts whenever it's possible. However, the overall price is something to be cautious about.

What other advice do I have?

We are using the latest version of the solution right now.

In general, we've been happy with the solution. I'd rate it at a nine out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
Flag as inappropriate
AL
Professor at a university with 51-200 employees
Real User
Top 5Leaderboard
Efficient deep learning that is easy to use and easy to setup

Pros and Cons

  • "The best part of RapidMiner is efficiency."
  • "I think that they should make deep learning models easier."

What is our primary use case?

The primary use case of RapidMiner is for teaching. I select data and give it to the students to clean. They use several algorithms, they compact them, they evaluate them, then give the results to me.

How has it helped my organization?

RapidMiner is used in my organization just in education

What is most valuable?

The feature that I like the most is the business of utilization. You just have to connect with an operator to launch an analysis state to read the data. Then, you filter with another operator and clean it with another one. You can also use machine learning algorithms and then it gives you the results.

Compared to when you are programming with Python, where you have to write all of the instructions, this is better because the effort of programming takes more time.

The best part of RapidMiner is efficiency.

What needs improvement?

I have the deep learning models on my laptop but it doesn't work very well. I think that they should make deep learning models easier. They are using deep learning models today for image processing and language processing.

For how long have I used the solution?

I am a professor of Data Science among other things. I use RapidMiner when I give a machine learning course  and that for the last six years. I started with the first version of RapidMiner and now use the latest version.

What do I think about the stability of the solution?

Very good 

What do I think about the scalability of the solution?

Not used yet

How are customer service and technical support?

I have never contacted technical support. Any time that I have had an issue, I have tried to solve it myself. 

One time when I had an issue with deep learning, 

Which solution did I use previously and why did I switch?

I use several solutions 

How was the initial setup?

The initial setup is very easy. The software is very easy to use.

I have been teaching for nine years and because I have the experience, it is easy.

What other advice do I have?

I have worked with RapidMiner, but I have not yet explored all of the functionality of the software. As an example, the relation with big data and the relation with the Cloud. I have used the utilities quite a bit.

In the last model, they added automation cleaning for data preparation. It is very interesting.

I am a computer scientist and I received my Ph.D. 23 years ago. I am a researcher, and when I have a problem, I use it to research and to find a solution to much more difficult problems.

I would rate this solution an eight out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
AL
Professor at a university with 51-200 employees
Real User
Top 5Leaderboard
Efficient, ease to use, and is a good teaching tool for data science and machine learning

Pros and Cons

  • "RapidMiner is very easy to use."
  • "I would like to see all users have access to all of the deep learning models, and that they can be used easily."

What is our primary use case?

I used RapidMiner to help teach Data Science and Statistics. The students understand better the pipeline of a data science project compared with development with a language like python furthermore the user can integrate easily python if he likes to. 

How has it helped my organization?

This solution assists me with instructing students on how to pre-processing data and use various machine learning algorithms.

What is most valuable?

The best thing about RapidMiner is efficiency.

RapidMiner is very easy to use. You just have to connect operators to launch an analysis state. You read the data with one operator, then filter it with another, and clean it with another. Then, you use an operator for the machine learning algorithms and the results are generated. In contrast, when you are programming in Python you have to write all of the instructions and it takes longer to do.

In the most recent version, they added automation of the cleaning of data, which is a very interesting feature.

What needs improvement?

I would like to see all users have access to all of the deep learning models, and that they can be used easily.

RapidMiner loads very slowly, which is something that should be improved.

For how long have I used the solution?

I have been using this solution for at least ten years. I have been using it since the first version was released.

What do I think about the stability of the solution?

This is a stable solution, although it sometimes takes a long time to load.

How are customer service and technical support?

Whenever I have a problem, I try to solve it myself.

One time when I had a problem with deep learning, I tried to get answers through the forums. I tried to chat through the application but I didn't get an answer.

Which solution did I use previously and why did I switch?

I have used many different tools and have experience with a lot of different software, and I can say that RapidMiner is very easy to use.

How was the initial setup?

The initial setup of RapidMinder is very easy.

What about the implementation team?

I installed the software myself.

What's my experience with pricing, setup cost, and licensing?

I have been using the educational version.

What other advice do I have?

I have not worked with all of the features in RapidMiner. For example, I have not worked with all of the features for Big Data, and I have not used it with the cloud.

I would rate this solution an eight out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Flag as inappropriate
PS
Pre-Sales BI at a wholesaler/distributor with 10,001+ employees
Real User
Top 10
An excellent software platform that extracts information and data

Pros and Cons

  • "The most valuable feature is what the product sets out to do, which is extracting information and data."
  • "A great product but confusing in some way with regard to the user interface and integration with other tools."

What is our primary use case?

I work in business intelligence pre-sales and am a customer of RapidMiner. I'm using the product for data mining. I'm involved in data science and doing a Masters degree in the field so I'm aiming to explore the data I have gathered. I have some experience with the product but it's not that deep. 

What is most valuable?

The most valuable feature is what the product sets out to do, which is extracting information and data. 

What needs improvement?

I think it's a great product but confusing in some way with regard to the user interface and integration with other tools. An improvement would be the addition of some buttons which would be useful because I'm sometimes unsure why I need to use something or what is its purpose. 

I would say the same goes for additional features, the addition of buttons would be helpful. The product is better than other software that I use. 

For how long have I used the solution?

I've been using the product for two or three months although I'm not using the latest version. 

What do I think about the stability of the solution?

I believe the scalability is very good. 

What do I think about the scalability of the solution?

The solution is stable. I think the tools the product provides are all that I require for my specific needs. 

How are customer service and technical support?

Normally I use the community for support, and don't make contact with customer service. I have some friends who are working in that area and they help me. I also ask my coworkers for assistance. 

What other advice do I have?

The tools have a complete function for doing data. I'm not quite sure about the speed of RapidMiner but I think it's the fastest solution that I use. I don't think the product consumes a lot of RAM, which is good. 

There is something confusing in the product but it's possible that the error is mine and maybe I'm not yet familiar enough with the product.

I would therefore rate this product a nine out of 10. 

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Andrey  Alfiansyah
Executive Director at a philanthropy with 201-500 employees
Real User
Top 20
Helps you make informed decisions using data

What is our primary use case?

We use RapidMiner to do data modeling and forecasting for things like bank loans. It helps us determine which bank loans have potential and which loans can be issued. So, for example, we use the data from our existing customers to make predictions about new borrowers. That way, we can estimate how well the loan will perform.

What needs improvement?

Many things in the interface look nice, but they aren't of much use to the operator. It already has lots of variables in there. 

For how long have I used the solution?

I've been using RapidMiner for around two years.

What do I think about the stability of the solution?

Right now, we're trying to see how stable the profiling and modeling features are, so we are still experimenting through trial and…

What is our primary use case?

We use RapidMiner to do data modeling and forecasting for things like bank loans. It helps us determine which bank loans have potential and which loans can be issued. So, for example, we use the data from our existing customers to make predictions about new borrowers. That way, we can estimate how well the loan will perform.

What needs improvement?

Many things in the interface look nice, but they aren't of much use to the operator. It already has lots of variables in there. 

For how long have I used the solution?

I've been using RapidMiner for around two years.

What do I think about the stability of the solution?

Right now, we're trying to see how stable the profiling and modeling features are, so we are still experimenting through trial and error.

What do I think about the scalability of the solution?

I can't say much about scaling up RapidMiner because we're using it for only one department.

How are customer service and support?

RapidMiner technical support offers so much information. I feel that they've been helpful. 

How was the initial setup?

Installing RapidMiner is relatively straightforward. I am only on a license for analyzing the data by itself. After installation, you have to optimize the module applications using the standard process and methodology, so the total time for deployment is around two or three weeks.

What's my experience with pricing, setup cost, and licensing?

We pay the RapidMiner license monthly.

What other advice do I have?

I rate RapidMiner 10 out of 10. It's helpful if you want to make informed decisions using data. We can take the information, tease out the attributes, and label everything. It's suitable for profiling and forecasting in any industry.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Flag as inappropriate