Now let’s try word2vec on the property details page and see if we can find something interesting.
load libs
1 |
|
The poor rabbit chased by Python and Anaconda :p
If you:
Just for some extra fun, Let’s do some plots to explore the ads dataset a bit
1 |
|
I think there is no solid evidence to prove which is better than another. These two algorithms build from different methods with different hyper-parameters to tune. Therefore, I think the right approach is to understand the pros and cons of the two, recall the pros and cons when solving your specific problem.
When I first started learning Pandas library, I seriously suffered a memory issue:
My brain said:
It’s hard to write the pandas functions into the disk, while your memory is not enough.
In data science data cleaning stage, we may encounter this situation:
In one column you have multiple features, each feature has multiple values, they all stacked in one column.