Large DatasetsΒΆ
Calculating item similarities is computationally heavy, in terms of cpu cycles, amount of RAM and database load.
Some strategy you can use to mitigate it includes:
- Parallelize the precomputing task. This could be achieved by disabling the default task (via
RECOMMENDS_TASK_RUN = False
) and breaking it down to smaller tasks (one per app, or one per model), which will be distributed to different machines using dedicated celery queues.