Gravar-mail: Multi-Turn LLM Red Teaming via Token Steering and RL-Trained Policy