In our previous issue, we revisited the history of deep learning. It was important to first have a foundation before diving deep into the latest trends.This week, we will talk about a recent paper titled Pix2seq: A Language Modeling Framework for Object Detection from Google AI scientists(the paper was published in ICLR 2022). On a high-level note, the paper reveals a simple and elegant way of performing object detection using a natural language approach. Keeping things simple, we will talk about the motivation behind the paper, the architecture, and the results on benchmark datasets.
Neural Networks Should be Able to Read…
In our previous issue, we revisited the history of deep learning. It was important to first have a foundation before diving deep into the latest trends.This week, we will talk about a recent paper titled Pix2seq: A Language Modeling Framework for Object Detection from Google AI scientists(the paper was published in ICLR 2022). On a high-level note, the paper reveals a simple and elegant way of performing object detection using a natural language approach. Keeping things simple, we will talk about the motivation behind the paper, the architecture, and the results on benchmark datasets.