Enviar por SMS: Developing reliability metrics and validation tools for datasets with deep linguistic information