THE CLASSIFICATION OF OCCUPATIONS FOR ONLINE JOB ADVERTISEMENTS CHALLENGE - The second round of the European Statistics Awards for Web Intelligence
Online job advertisements contain various types of information including a job description, information about the company looking to hire, job benefits, requirements for job seekers, etc. In order to calculate meaningful statistics given the data collection method and size of the online job advertisements datasets, occupational class labels must be provided for these various entries. Within the WI CLASSIFICATION CHALLENGE, teams will compete using advanced modelling techniques to develop an efficient and robust automated solution for correctly assigning class labels.
The second round of European Statistics Awards for Web Intelligence will begin in June 2024 with registrations open until 15 July 2024.
Timeline
The competition will begin on 1 June 2024 and will run for four months until 30 September 2024. The deadline for registration is 15 July 2024.
Awards
Accuracy Award
First Prize EUR 10 000
Second Prize EUR 5 000
Third Prize EUR 3 000
Reusability Award
First Prize EUR 10 000
Second Prize EUR 5 000
Third Prize EUR 3 000
Innovativity Award
First Prize EUR 5 000
Second Prize EUR 3 000
Third Prize EUR 1 000
Teams
Teams comprising a maximum of five individuals with diverse backgrounds and expertise in programming and web intelligence are eligible to participate in the competition. This contest presents an exceptional chance to apply your understanding of classification modelling in an actual context and potentially receive up to EUR 10 000 for developing the most accurate model. If your team secures the top spot for all three awards, you could earn up to EUR 25 000 in this round.
Find out moreThe Web Intelligence - Deduplication Challenge
The winners of the web intelligence - deduplication challenge have been announced
A part of the European Statistics Awards Program aims at stimulating innovation in the area of Web Intelligence for European statistics, focusing on identifying potential duplicate job postings on websites as a basic condition to produce high quality statistics from online job advertisements.
Find out moreFrequently asked questions
I am having trouble uploading my submission. The submit button is grey or turns grey after clicking the submit button and selecting the appropriate zip file.
Recently, a technical issue has appeared associated with Codalab and the use of certain browsers. We advise teams to use Firefox instead of Edge or Chrome which appears to solve the problem.
Under which legal system is the NDA signed?
As a rule, EU law applies. The implementation of the terms of use shall be governed by Luxembourg law; the courts in Luxembourg shall have sole jurisdiction to hear any disputes.
In the event of a dispute (eg. breach of NDA) the Commission can take action by filing a complaint or by reporting the breach to the police on the basis of national legislation.
Is it permissible to use the eTranslation tool from the European Commission?
The current NDA doesn't foresee such a possibility and using eTranslation would mean losing control over data thus breaking the provisions of the NDA.
Moreover, the eTranslation service is not available to everybody so it would disrupt the level playing field for the other competitors.
Does the question regarding data security issues and the use of the ETRANSLATION tool extend to using other 3rd party APIs, for instance, Google Translate, OpenAI?
Yes, it does. Sending the job advertisement text to third-party API servers makes it accessible to those third parties, which violates the terms of the NDA.
Considering the terms of the NDA, are teams expected to develop their solutions locally on their own machines, or does the possible restriction on third party APIs extend to spinning up remote GPU machines for model training?
No, you don't have to develop your solution locally. You can use cloud infrastructure as long as access to the data is restricted to those who have signed the NDA. This means you are responsible for ensuring the security of the cloud resources you use. You must control who can access the data and ensure that data transmission between your local machines and the cloud is secure, such as through encryption.
Our team has made 2 failed and 1 successful submission. The performance ranking states that we've made 3 submission. Are we still able to make 9 successful submissions?
Failed submissions DO NOT count towards the submission limits. Only valid, successful submissions are counted.
We will periodically make corrections on the performance ranking page and adjust for failed attempts.
Even if the performance ranking currently indicates the total number of submissions which includes failed attempts, we will check the total number of VALID submissions during the evaluation phase and disregard all FAILED attempts, ensuring that each team is allowed 10 VALID submissions.