Enviar por SMS: Leveraging dynamic masked softmax and shared hidden layers for hierarchical text-based product classification with bert