top of page
  • Improved video captioning based on Show-and-Tell model.

    • image tags

    • audio feature (MFCC)

    • spatial attention

    • GAN (novel)

  • Attended TREC Video Retrieval Evaluation (TRECVID) 2017 ‘Video to Text’ (VTT) competition held by NIST.

 Exploring CNN-LSTM Architectures for Video Captioning

Project @ CMU 11775 Large-Scale Multimedia Analysis 

Mar 2017 - May 2017

all-fail-2
Screen Shot 2017-09-22 at 2.11.41 AM
Screen Shot 2017-09-22 at 2.13.23 AM
Screen Shot 2017-09-22 at 2.15.07 AM

© 2023 by Sasha Blake. Proudly created with Wix.com

bottom of page