Rahman, Tahmid; Mahmud, Shahriar; Nasrum, Nur
(Department of Computer Science and Engineering(CSE), Islamic University of Technology(IUT), Board Bazar, Gazipur-1704, Bangladesh, 2024-11-30)
In the realm of Bangla conversational agents, this research endeavors to elevate
the responsiveness of Large Language Models (LLMs) through the synergistic ap plication of Reinforcement Learning from Human Feedback (RLHF) ...