This is my school project. It focuses on Reinforcement Learning for personalized news recommendation. The main distinction is that it tries to solve online off-policy learning with dynamically generated item embeddings. I want to create a library with SOTA algorithms for reinforcement learning recommendation, providing the level of abstraction you like.

Features

  • You can import the entire algorithm (say DDPG) and tell it to ddpg.learn(batch), you can import networks and the learning function separately, create a custom loader for your task, or can define everything by yourself
  • Examples do not contain any of the junk code or workarounds: pure model definition and the algorithm itself in one file. I wrote a couple of articles explaining how it functions
  • Documentation available
  • The learning is built around sequential or frame environment that supports ML20M and like
  • Seq and Frame determine the length type of sequential data, seq is fully sequential dynamic size (WIP), while the frame is just a static frame
  • State Representation module with various methods. For sequential state representation, you can use LSTM/RNN/GRU (WIP)
  • Parallel data loading with Modin (Dask / Ray) and caching
  • Pytorch 1.7 support with Tensorboard visualization.

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow RecNN

RecNN Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of RecNN!

Additional Project Details

Programming Language

Python

Related Categories

Python Libraries, Python Reinforcement Learning Frameworks, Python Reinforcement Learning Libraries, Python Reinforcement Learning Algorithms

Registered

2024-06-04