How to set up a Pyspark Session?I would note that as a Data Scientist, I would not dare to say that you don’t need to understand Data Engineer tasks. Conversely, I also…Oct 13, 2023Oct 13, 2023
Is your database GDPR proof? How can we use python and NLP tools to check it?Of course, everybody tries to anonymize sensitive information although it is not such an easy task. Sometimes we make mistakes.Jul 23, 2021Jul 23, 2021
Spatial join with spark pandas_udfI am a data scientist. During my work, I meet huge datasets that contain spatial data information for example coordinates.Jun 29, 2021Jun 29, 2021