qywu.github.io - Qingyang's Log

Example domain paragraphs

This tutorial covers how to setup a cluster of GPU instances on AWS and use Slurm to train neural networks with distributed data paralleli... Jan 22, 2020 Importance-Aware Learning for Neural Headline Editing Many social media news writers are not professionally trained. Therefore, social media platforms have to hire professional editors to adjust amateur headlines to attract more readers. We aim to automate the headline editing process to ... Jun 23, 2019 Notes on CVPR 2019 This is a note of thoughts and s

This is a practical analysis of how Gradient-Checkpointing is implemented in Pytorch, and how to use it in Transformer models like BERT and GPT2.

Links to qywu.github.io (8)