• Home
  • Professional Skills Framework

Professional Skills Framework

Mapping the Data Analytics and Data Science Profession

EDISON Data Science Framework

Our diploma and certifications are mapped to the EDISON Data Science Framework (EDSF) which has been developed to support, guide and ultimately accelerate the education process of fit-for-purpose Data Science Professionals.

The EDSF is a collection of documents that define the Data Science profession. Freely available, these documents have been developed to guide educators and trainers, employers and managers, and Data Scientists themselves. This collection of documents collectively breakdown the complexity of the skills and competences need to define Data Science as a professional practice.

The EU-funded EDISON Project has put in place foundation mechanisms that will speed-up the increase in the number of competent and qualified Data Scientists across Europe and beyond

Defining the Profession

In a rapidly developing profession like Data Analytics and data Science, it is important for students, professionals and employers to be able to map how each role is defined, where they interface and how they interact.

The Professional Skills Framework brings together these elements into one cohesive model and defining the skills, knowledge items and competences required for each role.

Business Analytics Lifecycle

Each professional role defined in the Framework plays an important part in the overall Analytics Lifecycle. At different stages in the lifecycle, some roles become more prominent while others support. This can change and evolve as the project progresses.

Professional Skills Framework

The Professional Skills Framework defines the skills, knowledge items and professional competences required for the most common roles in the Data Analytics and Data Science sector.

Skills
  • Proficiency in at least one of these skills
  • Proficiency in all of these skills
Data Analytics Professional
Data Application Engineer
Data Engineer
Data Scientist
Data Management
Informatica
Abinitio
Talend
IBM Data Studio
SAS Data Integration
Dataflux
Data Scientist
Analytics
R
IBM SPSS
SAS
Alteryx
Data Scientist
Visualisation
Tableau
Spotfire
Qlik
PowerBI
SAS Visual Analytics
Tableau
Spotfire
Qlik
PowerBI
SAS Visual Analytics
Languages
Python
SQL
T-SQL (Stored procedures and functions)
PL/SQL
pgPL/Sql
Pig
Hive
Impala
Python
Java
JavaScript (or similar)
Go
Ruby
C
C#
VBA
Databases
SQL Server
PostgreSQL
Teradata
Oracle
IBM DB2
MySql
SAP HANA
Mongo DB
Data Scientist
Shell Scripting
DOS batch
Power Shell
BASH (UNIX/LINUX utilities)
Data Scientist
Other tools/software
JIRA
MOVEit
SVN (Subversion)
Git
Monarch
JIRA
MOVEit
SVN (Subversion)
Git
Data Scientist
Techniques and methods
Machine Learning
Predictive Modeling
Big data concepts
Data Warehousing
Big data concepts
Data Warehousing
Data Scientist
Knowledge Bodies
Data Analytics Professional
Data Application Engineer
Data Engineer
Data Scientist
Machine Learning (supervised): Decision trees, Naïve Bayes classification, Ordinary least square regression, Logistic regression, Neural Networks, SVM (Support Vector Machine), Ensemble methods, others
Systems Engineering and Software Engineering principles, methods and models, distributed systems design and organisation
Data management and enterprise data infrastructure, private and public data storage systems and services
Data Scientist
Machine Learning (unsupervised): clustering algorithms, Principal Components Analysis (PCA), Singular Value Decomposition (SVD), Independent Components Analysis (ICA)
Cloud Computing, cloud based services and cloud powered services design
Data storage systems, data archive services, digital libraries, and their operational models
Data Scientist
Machine Learning (reinforced): Q-Learning, TD-Learning, Genetic Algorithms)
Big Data technologies for large datasets processing: batch, parallel, streaming systems, in particular cloud based
Data governance, data governance strategy, Data Management Plan (DMP)
Data Scientist
Data Mining (Text mining, Anomaly detection, regression, time series, classification, feature selection, association, clustering)
Applications software requirements and design, agile development technologies, DevOps and continuous improvement cycle
Data Architecture, data types and data formats, data modeling and design, including related technologies (ETL, OLAP, OLTP, etc.)
Data Scientist
Text Data Mining: statistical methods, NLP, feature selection, apriori algorithm, etc.
Systems and data security, data access, including data anonymisation, federated access control systems
Data lifecycle and organisational workflow, data provenance and linked data
Data Scientist
Prescriptive Analytics
Compliance based security models, privacy and IPR protection
Data curation and data quality, data integration and interoperability
Data Scientist
Prescriptive Analytics
Relational, nonrelational databases (SQL and NoSQL), Data Warehouse solutions, ETL (Extract, Transform, Load), OLTP, OLAP processes for large datasets
Data protection, backup, privacy, IPR, ethics and responsible data use
Data Scientist
Graph Data Analytics: path analysis, connectivity analysis, community analysis, centrality analysis, subgraph isomorphism, etc.
Big Data infrastructures, high-performance networks, infrastructure and services management and operation
Metadata, PID, data registries, data factories, standards and compliance
Data Scientist
Qualitative analytics
Modeling and simulation, theory and systems
Open Data, Open Science, research data archives/repositories, Open Access, ORCID
Data Scientist
Natural language processing
Information systems, collaborative systems
Data preparation and pre-processing
Data Scientist
Business Analytics (BA) and Business Intelligence (BI); methods and data analysis; cognitive technologies
Optimisation
Data Scientist
Data Warehouses technologies, data integration and analytics
Data driven User Experience (UX) requirements and design
Competencies
Data Analytics Professional
Data Application Engineer
Data Engineer
Data Scientist
Use appropriate data analytics and statistical techniques on available data to discover new relations and deliver insights into research problem or organizational processes and support decision-making.
Use engineering principles and modern computer technologies to research, design, implement new data analytics applications; develop experiments, processes, instruments, systems, infrastructures to support data handling during the whole data lifecycle.
Develop and implement data engineering strategy for data collection, integration, quality, lineage, security, storage, preservation, and availability for further processing.
Data Scientist
Effectively use variety of data analytics techniques, such as Machine Learning (including supervised, unsupervised, semisupervised learning), Data Mining, Prescriptive and Predictive Analytics, for complex data analysis through the whole Business Analytics lifecycle
Use engineering principles (general and software) to research, design, develop and implement new instruments and applications for data collection, storage, analysis and visualisation
Develop and implement data strategy, in particular, in a form of data management policy and Data Management Plan and path to execution of the plan - tooling & steps
Data Scientist
Apply designated quantitative techniques, including statistics, time series analysis, optimization, and simulation to deploy appropriate models for analysis and prediction
Develop and apply computational and data driven solutions to domain related problems using wide range of data analytics platforms, including Big Data technologies for large datasets and cloud based data analytics platforms
Develop and implement relevant data models, define metadata using common standards and practices, for different data sources in variety of scientific and industry domains
Data Scientist
Identify, extract, and pull together available and pertinent heterogeneous data, including modern data sources such as social media data, open data, governmental data
Develop and prototype specialised data analysis applicaions, tools and supporting infrastructures for data driven scientific, business or organisational workflow; use distributed, parallel, batch and streaming processing platforms, including online and cloud based solutions for on-demand provisioned and scalable services
Integrate heterogeneous data from multiple source and provide them for further analysis and use
Data Scientist
Understand and use different performance and accuracy metrics for model validation in analytics projects, hypothesis testing, and information retrieval in line with the Business Analytics Lifecycle
Develop, deploy and operate large scale data storage and processing solutions using different distributed and cloud based platforms for storing data
Maintain historical information on data handling, including reference to published data and corresponding data sources - Data Lineage and Data Dictionary
Data Scientist
Develop required data analytics for organizational tasks, integrate data analytics and processing applications into organization workflow and business processes to enable agile decision making (Stage 5&6 of the Business Analytics Lifecycle)
Consistently apply data security mechanisms and controls at each stage of the data processing, including data anonymisation, privacy and IPR protection.
Ensure data quality, accessibility, interoperability, compliance to standards, and publication
Data Scientist
Visualise results of data analysis, design dashboard and use storytelling methods
Design, build, operate relational and nonrelational databases (SQL and NoSQL), integrate them with the modern Data Solutions, ensure effective ETL (Extract, Transform, Load), OLTP, OLAP processes as appropriate to the Data Application being engineered
Design, build, operate appropriate effective ETL (Extract, Transform, Load) solutions and processes for the Data Analysis being performed such that they can be both implemented and scaled into target environments
Data Scientist
Visualise results of data analysis, design dashboard and use storytelling methods
Visualise results of data analysis, design dashboard and use storytelling methods
Data Scientist

Our Certifications

  • Certified Data Scientist

    Learn more

  • Certified Data Applications Engineer

    Learn more

  • Certified Business Data Analyst

    Learn more

  • Certified Data Analyst

    Learn more

  • Certified Data Engineer

    Learn more