Multimodal Pretraining for Dense Video Captioning 
Traffic forecasting as a canonical task of multivariate time series
forecasting has been a significant research topic in AI community. To address
the spatio-temporal heterogeneity and non-stationarity implied in the traffic
stream, in this study, we propose Spatio-Temporal Meta-Graph Learning as a
novel Graph Structure Learning mechanism on spatio-temporal data. Specifically,
we implement this idea into Meta-Graph Convolutional Recurrent Network
(MegaCRN) by plugging the Meta-Graph Learner powered by a Meta-Node Bank into
GCRN encoder-decoder. We conduct a comprehensive evaluation on two benchmark
datasets (i.e., METR-LA and PEMS-BAY) and a new large-scale traffic speed
dataset called EXPY-TKY that covers 1843 expressway road links in Tokyo. Our
model outperformed the state-of-the-arts on all three datasets. Besides,
through a series of qualitative evaluations, we demonstrate that our model can
explicitly disentangle the road links and time slots with different patterns
and be robustly adaptive to any anomalous traffic situations. Codes and
datasets are available at https://github.com/deepkashiwa20/MegaCRN.