Data Engineer (Scala, Spark)

FAQs

What programming languages are preferred for this role?

The preferred programming languages for this role are Python and Scala, particularly with experience in Spark.

What kind of projects will I work on as a Data Engineer?

You will develop projects from scratch, utilizing Apache Spark for both Batch and Real-Time architectures, and participate in designing architectures and decision-making.

Is cloud experience required for this position?

Yes, experience or knowledge in cloud platforms such as Azure, AWS, or GCP is required.

Will I be required to communicate in English?

Yes, English is important for our projects as we often work with international clients and daily communication is primarily in English.

How many years of experience should I have for this role?

You should have at least 2 years of experience in Python or Scala and Spark, processing large volumes of data.

What tools and technologies will I be working with?

You will be working with tools such as Databricks, Data Factory, Synapse, Apache Airflow, among others, for developing ETLs and scalable pipelines.

Is a background in teamwork important for this position?

Yes, being a team player and having a willingness to continue learning are important qualities for this role.

Does Capgemini offer training and development opportunities?

Yes, Capgemini offers a wide range of training opportunities, including access to platforms like Coursera, Udemy, and Capgemini University.

What are some benefits of working at Capgemini?

Benefits include a unique work environment, flexible holiday options, continuous training, wellbeing initiatives, and participation in volunteer and social action activities, among others.

Is there a policy in place for diversity and inclusion?

Yes, Capgemini has a commitment to inclusion and equality of opportunity, implementing a Plan of Equality and a Code of Ethics to ensure non-discrimination based on various personal and social circumstances.