Skip to content

Image Captioning¤

Coming Soon

This example is planned for a future release. Check back for updates on image captioning implementations.

Overview¤

This example will demonstrate:

  • Image-to-text generation
  • Attention-based captioning models
  • Beam search decoding
  • Caption quality evaluation (BLEU, CIDEr)

Planned Features¤

  • CNN + RNN caption generation
  • Transformer-based captioning
  • Visual attention mechanisms
  • Dense captioning for regions

References¤

  • Xu et al., "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention" (2015)
  • Vinyals et al., "Show and Tell: A Neural Image Caption Generator" (2015)