10 Best Face Datasets For Facial Recognition Projects
January 20, 2021

Facial recognition, one of the major research areas, has been adopted by organizations and governments for a few years now. Smartphone makers like Apple, Xiaomi, Huawei, Samsung, OPPO, Realme,among others, have been integrating this technology into their phones for providing maximum security to the users.

According to the study, facial recognition market size is expected to grow from USD 3.8 billion in 2020 to USD 8.5 billion by 2025, at a CAGR of 17.2% during the forecast period.

It takes a human 0.2 seconds to recognize a specific face,and most people can recognize about 5,000 faces. We also interpret facial expressions and detect emotions automatically. In other words, we’re naturally good at facial recognition and analysis.

In recent years, Computer Vision (CV) has been catching up and in some cases outperforming humans in facial recognition. Advancing CV and Machine Learning have created solutions that can handle tasks more efficiently and accurately than humans. (source:Forbes 20 January 2021.)

While there are so many databases in use currently, the choice of appropriate databases are so important that should be made based on the task given (emotions, aging, expressions, lighting etc).

In order to help researchers looking for the suitable datasets for their needs, we provide 9 datasets focused on human faces which are popular and high-quality. We’ll list some key characteristics and strengths and weaknesses of each. 

1、CASIA-SURF HiFi Mask Data-set

Publication – SurfingTech

Released – 2021

Description – This dataset is CVPR 2021 Challenge contains both in-person reality videos and attack videos of each subject. There are totally 75 subjects (25 Asians, 25 Africans, 25 Caucasians). There is one-to-one correspondence between real person and its masks. Total is 62.4K videos

Main Use – Anti-spoofing face recognition

Size – 9T

Identities – 75

Data Gathering Method – Intel Realsense D435

2、Photo Attack Anti-spoofing Facial Dataset

Publication – SurfingTech

Released – 2019

Description – Photo Attack Anti-spoofing Facial Dataset is a large-scale face attributes dataset with 6K people faces, more than 48K videos.

Main Use – Anti-spoofing face recognition

Size – 10T

Identities – 6,000

Data Gathering Method – Intel Realsense SR300

3、Screen/Cloth Attack Anti-spoofing Facial Dataset

Publication – SurfingTech

Released – 2019

Description – Screen/Cloth Attack Anti-spoofing Facial Dataset is a large-scale face attributes dataset with 3K people faces, more than 42K videos.

Main Use – Anti-spoofing face recognition

Size – 15T

Identities – 3,000

Data Gathering Method – Intel Realsense SR300/D435i

4、3D Mask Attack Anti-spoofing Facial Dataset

Publication – SurfingTech

Released – 2019

Description – Screen/Cloth Attack Anti-spoofing Facial Dataset is a face attributes dataset with hundreds people faces, almost 6K videos.

Main Use – Anti-spoofing face recognition

Size – 3T

Identities – 148

Data Gathering Method – Intel Realsense D435i

5、2020 Anti-spoofing Facial Dataset

Publication – SurfingTech

Released – 2020

Description – 2020 Anti-spoofing Facial Dataset is a face attributes dataset with hundreds of people faces, almost 609.2K videos.

Main Use – Anti-spoofing face recognition

Size – 20T

Identities – 1,800

Data Gathering Method – Intel Realsense D435i

6、Multiracial 3D Multi-expression Facial Dataset

Publication – SurfingTech

Released – 2020

Description – Multiracial 3D Multi-expression Facial Dataset is a face attributes dataset with hundreds of people faces, almost 50.4K videos,and cover more than 20 countries and different ethnicity.

Main Use – Ai training

Size – 2T

Identities – 8,400

Data Gathering Method – Intel Realsense D435

7、3D body scan Dataset

Publication – SurfingTech

Released – 2021

Description – 3D body scan Dataset is a body attributes dataset with hundreds of people bodies, almost 36K videos,depth data.

Main Use – Ai training

Size – 2T

Identities – 300

Data Gathering Method – Intel Realsense D455

8、African 3D Multi-posture Facial data

Publication – SurfingTech

Released – 2020

Description – African 3D Multi-posture Facial data is a face attributes dataset with hundreds of people faces, almost 18K videos.

Main Use – Ai training

Size – 2T

Identities – 3,000

Data Gathering Method – Intel Realsense SR300

9、Chinese 3D HD Facial Dataset

Publication – SurfingTech

Released – 2020

Description – Chinese 3D HD Facial Dataset is a face attributes dataset with hundreds of super high-definition 3D face expressions data , almost 11K videos.

Main Use – Ai training

Size – 10T

Identities – 850

Data Gathering Method – 3DMD

10、South Asian 3D Multi-expression Facial Dataset

Publication – SurfingTech

Released – 2020

Description – South Asian 3D Multi-expression Facial Dataset is a face attributes dataset with hundreds of face expressions data, almost 12K videos.

Main Use – Ai training

Size – 2T

Identities – 2,000

Data Gathering Method – Intel Realsense SR300