Browsing: reinforcement fine-tuning learning