Multi-modal video retrieval using Dilated Pyramidal Residual network