Requirements
- 0-1 years of relevant working experience.
- Proficient in programming languages commonly used to implement web scraping applications such as Python and SQL.
- Basic experience with web scraping techniques, tools, and frameworks. For example python libraries such as Scrapy, Selenium, HTTPX, Requests.
- Basic experience in extracting structured and/or unstructured data from different types of websites and APIs and handling various data formats (e.g., HTML, JSON, CSV) using extraction tools like Beautifulsoup, dataclasses/pydantic, regular expressions/regex).
- Basic experience in storing the extracted data in SQL databases and writing SQL queries for further transformation.
- Strong attention to detail to ensure the accuracy and quality of scraped data. Strong critical thinking to solve bugs, implement monitoring, testing, cleaning and validation techniques to identify and handle errors or missing information.
NucleusBI is an international B.I. and A.I. Agency based in Haarlem, the Netherlands and Canggu, Bali, Indonesia. We build end-to-end data solutions, from connectors and datawarehousing to reporting and analysis and A.I.-applications for international clients.
As a junior data engineer in our team, you will be responsible for developing, monitoring and extracting web scrapers to our clients requirements.
Preferred skills & qualifications:
- Version Control and Collaboration: Familiarity with version control systems like Git or other similar tools.
- Task Scheduling: Experience with task schedulers such as cronjobs or workflow management systems like Airflow. Ability to schedule and automate scraping tasks based on specific time intervals or dependencies.
- Familiarity with containerization like Docker.
- Bachelor’s degree in Computer Science, IT or similar.
- Fluency in English
Responsibilities
- Developing web scrapers in collaboration with the data engineering team.
- Automating monitoring and validation procedures.
- Extracting and transforming raw output from the scrapers to fit user’s requirements.
Benefits:
- You will be working in a high-paced international environment where you will be encouraged to learn and improve your skill-set.
- A competitive salary based on skills and prior experience
- If you come from another part of Indonesia, we offer one-off reimbursement for flights and other relocation expenses.
Recruitment Process:
We will be looking to hire a Junior Data Engineer as soon as possible.
The recruitment process will be:
- A first interview
- A technical test
- A Second interview