YoungCapital

CASE STUDY

YOUNGCAPITAL

Youngcapital and Relevant Online utilise the power of AI.
Driving recruitment efficiency: predictive matchmaking model.
 

Challenges in online recruitment

YoungCapital faces a major challenge in efficiently matching job seekers with employers through online advertising. Managing marketing budgets across different channels and campaigns is difficult and labor-intensive, with unpredictable demand for vacancies adding complexity.

Existing tools like Google’s “Performance Max” or other programmatic solutions focus only on budget optimization within their own platforms, lacking the ability to match candidates to employers’ needs effectively. To address this, YoungCapital and Relevant Online developed the Predictive Automated Matchmaking Model, aimed at finding the most cost-effective channels to match candidates with employers, minimizing the “cost per hire.”

Utilising machine learning to optimise marketing costs matching efficiency candidate satisfaction marketing costs matching efficiency candidate satisfaction marketing costs matching efficiency candidate satisfaction marketing costs matching efficiency candidate satisfaction in recruitment

Approach

“By combining YoungCapital’s knowledge of recruitment with Relevant Online’s AI expertise, the coöperation revolutionises the procurement of candidates, resulting in increased efficiency, a significant lower cost per hire and enhanced customer- and candidate satisfaction.”
Peter Segerius, Senior Online Marketer at YoungCapital

General

The idea is to have a granulated understanding of data, and statistical impact patterns with the leverage of ML algorithms. This includes analysis of the statistical impact of marketing spend grouped by campaign, job type (job function), location and across different performance marketing channels such as Google Ads, Meta Ads manager, e-mail marketing and so on.

 

Data

The model is trained on data of job openings, job applications and Google Analytics 4. Ingesting more and more data does not necessarily have a positive impact on the model’s accuracy.

There is a general misconception that the more data is used, the better the outcome of the model will be. This is not always the case and we managed to find a sweet spot in the number of months of data to ingest from both business and machine learning perspectives.

The model is then used to predict the number of job applications for the job openings of the current week (i.e. the week of the prediction). The model is retrained every week using the new data of the previous week.

When building any model, data engineering is obviously a fundamental aspect. For the project, we utilise a data set stored in the data warehouse which was built together with Relevant Online over the past few years. The dataset consists of job openings, categories, locations, applications and marketing related info from, among others, Google Analytics 4. It enables tracking of candidates through YoungCapital’s systems, from the initial acquisition channel up until the final moment the candidate signs his or her contract.

Warehousing

When building any model, data engineering is obviously a fundamental aspect. For the project, we utilise a data set stored in the data warehouse which was built together with Relevant Online over the past few years. The dataset consists of job openings, categories, locations, applications and marketing related info from, among others, Google Analytics 4. It enables tracking of candidates through YoungCapital’s systems, from the initial acquisition channel up until the final moment the candidate signs his or her contract.

TRAINING THE MODEL

For training the model (Decision-making in ML infrastructure), we don’t use all of the features of the data set. Instead, we extract the features that are most relevant for the prediction of the number of job applications. This so-called “feature extraction” implies adjusting the data in a manner that provides better prediction results.

For the sake of transparency and explainability, Relevant Online and YoungCapital prefer a method, which enables tracing back which part of the data has affected the outcome of the machine learning product. Unlike some known and fancy neural networks, which are basically black box solutions, traceability and transparency of the ML are key factors continuously guiding Relevant Online’s development.

If you want to learn in more detail about the (open source) model, the metric and the algorithm which are used, you can read further after the results and conclusions of this case study.

 

BUSINESS IMPLICATIONS AND COSTS

Building and running a predictive model like the Predictive Automated Matchmaking Model has business implications and accompanying costs because it requires:

  • cloud infrastructure on Google Cloud Platform (GCP);
  • space for storing data in BigQuery and storing `metadata` of the pipeline process in Google Cloud Storage (GCS);
  • hosting for the `Dataproc` cluster for data preprocessing training the model;
  • time spent for maintenance of the pipeline, testing and adding new features.

Results and conclusion

Business value

The business value of the model stretches further than just as a candidate acquisition channel. The model also spots new chances in the market regarding scarcity of candidate profiles in the database, availability of matching job openings, difficulty of finding such profiles for the available jobs and the final revenue gained.

Besides the marketing department, many facets of the company benefit from the model. The allocation of ad spend, and how channels and candidates match, is also valuable information for the sales- and account teams in order to provide valuable candidate profile scarcity information to existing and potential customers.

In short, it offers the right candidates at the right time for the recruitment teams, improved allocation and understanding of marketing investments, insights for sales and national account teams about opportunities in the market, more information about pricing and the difficulty of finding profiles to take to customer negotiations, and a better understanding of financial drivers for marketing expenses for the finance department.

The accuracy of the models’ results, the transparency it offers and the ability to optimise it over multiple advertising channels makes it a worthwhile investment. The model will continue to help achieve the best possible ad spent and cost per hire, now and in the future.

about young capital

YoungCapital is a recruitment agency with over 20,000 candidates at work every week. YoungCapital connects the new generation of employees with employers by managing the recruitment and selection process, as well as handling contracts and salary payments. It has the largest database of young people in Europe and its growth has been unstoppable in recent years.

about relevant online

Relevant Online is a Data Tracking, Data Engineering and Data Science agency helping its customers to make the most out  of their data. In an ever evolving industry Relevant Online develops and optimises (data) architecture, tracking, dashboarding and builds (predictive) models to fit their clients’ needs.

Case study YoungCapital

Youngcapital and Relevant Online utilise the power of AI.
Driving recruitment efficiency: predictive matchmaking model. 

Challenges in online recruitment

YoungCapital invests heavily in online advertising efforts with two main goals: to find new talent as well as employers looking to fill vacancies. Matching the demand and supply of the right candidates to open vacancies is a huge challenge, as it is very labor-intensive. 

Manual allocation of the marketing budgets (across various online channels, regions and ad campaigns) by marketing specialists in an efficient way is extremely difficult. The factors influencing increases or decreases in demands for vacancies are very hard to guess. Furthermore, due to the sheer amount of different vacancies there always is a compromise in the amount of campaigns due to limited capacity and hours the marketing team has available.

Intuition alone is not sufficient to navigate the complexities of marketing platforms and audience behaviour. Different networks are trying to solve this challenge with their own tools and models like for example Google with “Performance Max”. Such tools all have the same limitations; they are black boxes and they are built only to optimise for that specific network.

Other programmatic SAAS solutions also don’t offer enough depth in matching based on the right amount of candidates seeked at any given time. Most only run based on the amount of budget put in, not the actual amount the employers seek.

YoungCapital and Relevant Online therefore defined the goal to develop a model which should predict the best match between candidates’ and clients’ (the employers) needs. The Predictive Automated Matchmaking Model intends to find the most cost-efficient channel to match candidates with employers at the lowest possible “cost per hire”.

Approach

“By combining YoungCapital’s knowledge of recruitment with Relevant Online’s AI expertise, the coöperation revolutionises the procurement of candidates, resulting in increased efficiency, a significant lower cost per hire and enhanced customer- and candidate satisfaction.”

Peter Segerius, Senior Online Marketer at YoungCapital

General

The idea is to have a granulated understanding of data, and statistical impact patterns with the leverage of ML algorithms. This includes analysis of the statistical impact of marketing spend grouped by campaign, job type (job function), location and across different performance marketing channels such as Google Ads, Meta Ads manager, e-mail marketing and so on.

Data

The model is trained on data of job openings, job applications and Google Analytics 4. Ingesting more and more data does not necessarily have a positive impact on the model’s accuracy.

There is a general misconception that the more data is used, the better the outcome of the model will be. This is not always the case and we managed to find a sweet spot in the number of months of data to ingest from both business and machine learning perspectives.

The model is then used to predict the number of job applications for the job openings of the current week (i.e. the week of the prediction). The model is retrained every week using the new data of the previous week.

When building any model, data engineering is obviously a fundamental aspect. For the project, we utilise a data set stored in the data warehouse which was built together with Relevant Online over the past few years. The dataset consists of job openings, categories, locations, applications and marketing related info from, among others, Google Analytics 4. It enables tracking of candidates through YoungCapital’s systems, from the initial acquisition channel up until the final moment the candidate signs his or her contract.

Training the model

For training the model (Decision-making in ML infrastructure), we don’t use all of the features of the data set. Instead, we extract the features that are most relevant for the prediction of the number of job applications. This so-called “feature extraction” implies adjusting the data in a manner that provides better prediction results.

For the sake of transparency and explainability, Relevant Online and YoungCapital prefer a method, which enables tracing back which part of the data has affected the outcome of the machine learning product. Unlike some known and fancy neural networks, which are basically black box solutions, traceability and transparency of the ML are key factors continuously guiding Relevant Online’s development.

If you want to learn in more detail about the (open source) model, the metric and the algorithm which are used, you can read further after the results and conclusions of this case study.

Business implications and costs

Building and running a predictive model like the Predictive Automated Matchmaking Model has business implications and accompanying costs because it requires:

  • cloud infrastructure on Google Cloud Platform (GCP);
  • space for storing data in BigQuery and storing `metadata` of the pipeline process in Google Cloud Storage (GCS);
  • hosting for the `Dataproc` cluster for data preprocessing training the model;
  • time spent for maintenance of the pipeline, testing and adding new features.

Results and conclusion

Business value

The business value of the model stretches further than just as a candidate acquisition channel. The model also spots new chances in the market regarding scarcity of candidate profiles in the database, availability of matching job openings, difficulty of finding such profiles for the available jobs and the final revenue gained.

Besides the marketing department, many facets of the company benefit from the model. The allocation of ad spend, and how channels and candidates match, is also valuable information for the sales- and account teams in order to provide valuable candidate profile scarcity information to existing and potential customers.

In short, it offers the right candidates at the right time for the recruitment teams, improved allocation and understanding of marketing investments, insights for sales and national account teams about opportunities in the market, more information about pricing and the difficulty of finding profiles to take to customer negotiations, and a better understanding of financial drivers for marketing expenses for the finance department.

The accuracy of the models’ results, the transparency it offers and the ability to optimise it over multiple advertising channels makes it a worthwhile investment. The model will continue to help achieve the best possible ad spent and cost per hire, now and in the future.

about young capital

YoungCapital is a recruitment agency with over 20,000 candidates at work every week. YoungCapital connects the new generation of employees with employers by managing the recruitment and selection process, as well as handling contracts and salary payments. It has the largest database of young people in Europe and its growth has been unstoppable in recent years.

about relevant online

Relevant Online is a Data Tracking, Data Engineering and Data Science agency helping its customers to make the most out  of their data. In an ever evolving industry Relevant Online develops and optimises (data) architecture, tracking, dashboarding and builds (predictive) models to fit their clients’ needs.