Software Engineer - (Spark R&D)

Budapest, Hungary · Engineering expand job description ↓

Description

RapidMiner Overview

RapidMiner builds tools for data scientists (“Real data science, fast and simple!”). We make data science teams more productive through a single platform that unifies data prep, machine learning, and model deployment.

More than 500,000 users in over 150 countries use RapidMiner products to acquire more customers, reduce financial risk, and increase operational performance. RapidMiner boasts a strong list of customers from across the globe in a broad spectrum of industries including leading pharmaceutical, media, manufacturing, and retail companies. We also have been in the leaders quadrant of the "Gartner Magic Quadrant for Data Science Platforms“ for the past 6 years straight.

We are venture capital (VC) backed with headquarters in Boston (MA, USA) and have locations in Dortmund (Germany), Budapest (Hungary), and London (UK).


Job Description

Title: Software Engineer
Department: Research & Development
Hiring Office: RapidMiner Kft., Aradi utca 8-10., 1062 Budapest, Hungary

Your role would include researching and participating in a new software development initiative - related to providing new solutions for distributed computing techniques on large data-sets - by utilizing various technologies from Spark and its related ecosystem. We are looking for a passionate Software Engineer to be part of our Big Data team which is delivering the best data science experience to our new and existing enterprise customers.


Responsibilities

  • Pick-up new concepts, tools, and technologies in the ever-changing ecosystem of Big Data, Data Warehouse, and Data Science Technologies
  • Relentlessly improve our software and beat the bugs
  • Debug into opensource libraries (H2O, Spark, HDFS, YARN, Hive) to find the best integration points
  • Support our customers and community by effectively communicating, analyzing, troubleshooting, and resolving wide ranges of technical issues
  • Make your mark! Design, develop, and deliver your innovate ideas as part of the product roadmap
  • Flexibility to travel may be required from time-to-time

Requirements

This isn't a checklist - if you think you would be a good candidate, apply! But here are some things we are looking for:

  • Masters degree or equivalent business experience in Computer Science or a related area
  • At least 2 years of extensive Java/Scala development experience
  • Good verbal and written communication skills in English
  • Ability to explore computational and storage complexity of different complex data science algorithms
  • Track record of participating in development of distributed systems
  • Proficiency in Git, Jenkins, Dockerized environments, Cloud computing and Infrastructure as Code
  • Experience of using any of the APIs provided by Spark, MLlib, H2O, Google BigQuery, S3, Azure DataLake, TensorFlow, HDFS, Hadoop, Yarn
  • Track record of conducting constructive code-reviews
  • Experience in test automation and quality assurance
  • RapidMiner, data mining, AI algorithms
  • Having the professionalism to occasionally read and debug Scala, R, Python codebases
  • Solid baseline knowledge of SQL and relational databases
  • Business understanding, delivering results and meeting customer expectations
  • Desire and ability to learn any of the above if needed

Benefits

RapidMiner offers highly competitive salaries and unparalleled growth and development opportunities – all to create a compelling and rewarding work environment.

  • Flexible working hours and possibility to occasionally work from home
  • Modern office equipment
  • Free beverages, fruit, and other goodies
  • Table tennis table to challenge your peers after lunch
  • Flat organisation structure, great team spirit, fun events
  • Easily accessible location (Oktogon, Budapest)


RapidMiner is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.

RapidMiner doesn't accept unsolicited agency resumes and won't pay fees to any third-party agency or firm that doesn't have a signed agreement with RapidMiner.

Personal information
Your Profile
Application Details
Must: Masters degree or equivalent business experience in Computer Science or a related area
Must: At least 2 years of extensive Java development experience
Must: Good verbal and written communication skills in English
Ability to explore computational and storage complexity of different complex data science algorithms
Track record of conducting constructive code-reviews
Experience in test automation and quality assurance
Developer or operational experience in a Hadoop distribution eg.: CDH, HDP, MapR, EMR, Apache
Experience of using any of the APIs provided by Spark, HDFS, Hadoop, Yarn, Hive, H2O, BigQuery, S3, TensorFlow, Kafka
Proficiency in Git, Jenkins, Dockerized environments, Cloud computing and Infrastructure as Code
RapidMiner, data mining, AI algorithms
Having the professionalism to occasionally read and debug Spark, JavaScript, R, Python codebases
Solid baseline knowledge of SQL and relational databases