To provide machine learning and data science experts with a more robust dataset for model training, the well-known Palmer Penguins dataset has been expanded from its original 344 rows to 100,000 rows. This substantial increase was achieved using an adversarial random forest technique, effectively generating additional synthetic data while maintaining key patterns and features. The method achieved an impressive accuracy of 88%, ensuring the expanded dataset remains realistic and suitable for classification tasks.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

[1] Ifeanyi Idiaye, "Palmer Penguins 100k", IEEE Dataport, 2024. [Online]. Available: http://dx.doi.org/10.21227/gh2z-ce85. Accessed: Apr. 19, 2025.
@data{gh2z-ce85-24,
doi = {10.21227/gh2z-ce85},
url = {http://dx.doi.org/10.21227/gh2z-ce85},
author = {Ifeanyi Idiaye },
publisher = {IEEE Dataport},
title = {Palmer Penguins 100k},
year = {2024} }
TY - DATA
T1 - Palmer Penguins 100k
AU - Ifeanyi Idiaye
PY - 2024
PB - IEEE Dataport
UR - 10.21227/gh2z-ce85
ER -
Ifeanyi Idiaye. (2024). Palmer Penguins 100k. IEEE Dataport. http://dx.doi.org/10.21227/gh2z-ce85
Ifeanyi Idiaye, 2024. Palmer Penguins 100k. Available at: http://dx.doi.org/10.21227/gh2z-ce85.
Ifeanyi Idiaye. (2024). "Palmer Penguins 100k." Web.
1. Ifeanyi Idiaye. Palmer Penguins 100k [Internet]. IEEE Dataport; 2024. Available from : http://dx.doi.org/10.21227/gh2z-ce85
Ifeanyi Idiaye. "Palmer Penguins 100k." doi: 10.21227/gh2z-ce85