hackquest logo

0g-promptRL

PromptRL is a reinforcement learning project for learning cost-aware LLM configurations. It treats prompt strategy selection as a Q-learning problem and searches for the best combination of model.

비디오

프로젝트 이미지 1
프로젝트 이미지 2
프로젝트 이미지 3

기술 스택

React
Next
Node

설명

PromptRL is a reinforcement learning project for learning cost-aware LLM configurations. It treats prompt strategy selection as a Q-learning problem and searches for the best combination of model, reasoning mode, and persona for different task difficulty levels. Instead of sending every task through the same model and prompt style, PromptRL explores a discrete action space and optimizes for output quality relative to inference cost.

Tasks Included

  • easy: short tweet generation

  • medium: constrained LinkedIn post generation

  • hard: beginner-friendly explanation of quantum computing

팀 리더
VVidip Ghosh
프로젝트 링크
부문
AIInfra