Dmytro Mishkin
Marrying classical CV and Deep Learning. I do things, which work, rather than being novel, but not working.
http://dmytro.ai
- The Illusion of Generalization: Re-examining Tabular Language Model Evaluation Aditya Gorla, Ratish Puduppully tl;dr: @tunguz.bsky.social and other Kaggle GMs were right all along about tabular DL models. arxiv.org/abs/2602.04031
- RayRoPE: Projective Ray Positional Encoding for Multi-view Attention Yu Wu, Minsik Jeon, Jen-Hao Rick Chang, @onceltuzel.bsky.social Shubham Tulsiani tl;dr: even more projective geometry based PE, if you know your camera pose. arxiv.org/abs/2601.15275
- S-MUSt3R: Sliding Multi-view 3D Reconstruction Leonid Antsfeld, Boris Chidlovskii, Yohann Cabon, @vincentleroy.bsky.social Jerome Revaud tl;dr: sliding MUSt3R for cheap long seq The most interesting is alignment: point-based+camera-based. KDTree local desc for loop closure arxiv.org/abs/2602.04517
- CoWTracker: Tracking by Warping instead of Correlation Zihang Lai, Eldar Insafutdinov, Edgar Sucar, Andrea Vedaldi tl;dr: warping head does help, so as using pretrained VGGT as a backbone. arxiv.org/abs/2602.04877
- Wid3R: Wide Field-of-View 3D Reconstruction via Camera Model Conditioning Dongki Jung, Jaehoon Choi, Adil Qureshi, Somi Jeong, Dinesh Manocha, Suyong Yeon tl;dr: Pi-3 with (ray+radial distance) point map prediction and wide/fisheye camera tokens. arxiv.org/abs/2602.05321
- Predicting Camera Pose from Perspective Descriptions for Spatial Reasoning Xuejun Zhang, Aditi Tiwari, Zhenhailong Wang, Heng Ji tl;dr: injecting camera poses info into LLM to help answer 3D related questions arxiv.org/abs/2602.06041
- Reposted by Dmytro Mishkin[Not loaded yet]
- Reposted by Dmytro Mishkin[Not loaded yet]
- Reposted by Dmytro Mishkin[Not loaded yet]
- Incrementing your names can do wonders mp3 -- just sound mp4 -> now you have video! mp5 - now you have a machine gun!
- BBoxMaskPose v2: Expanding Mutual Conditioning to 3D Miroslav Purkrabek, Constantin Kolomiiets, Jiri Matas tl;dr: sota in human pose estimations, especially for the hard cases arxiv.org/abs/2601.15200
- CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx Lukáš Picek et 16 al. tl;dr; pose estimation, but for Lynx. 39k labeled images from 2009 to 2025. Synthetic renders as a bonus. arxiv.org/abs/2506.04931
- XRefine: Attention-Guided Keypoint Match Refinement Jan Fabian Schmid, Annika Hagemann tl;dr: pairwise kpt2subpix with self-attention and study how kpt accuracy influences the camera pose accuracy arxiv.org/abs/2601.12530
- Reposted by Dmytro Mishkin[Not loaded yet]
- Reposted by Dmytro MishkinI have nothing personal with Walter. On the contrary, I honestly thank him for all his service for science! The following is a complaint for our community as a whole. This PAMI-TC newsletter clearly shows that we are doing business as usual, as if nothing is happening in the world right now. 1/3
- The February issue of the PAMI-TC newsletter is out: tinyurl.com/4w4d6xjk Inside is a call for motions for #WACV2026, a few open calls for #CVPR2026, and a bunch of other misc. vision items to consider.
- Modern monetization strategy of mobile games is evil. Remember classic Lemmings? It was an amazing 1991 game. Later, in 2018 there was AMAZING Android port Revisited Lemmings. It was free, but I am fine paying $10 for it. No! Today you have - spinwheels, limited lives trash. And design is bad.
- Recently I have bought, but haven't really read (just chapter 1) the Source Code book. I guess I just toss it away now?
- Reposted by Dmytro Mishkin[Not loaded yet]
- Reposted by Dmytro Mishkin[Not loaded yet]
- Vega plots and confusion tables in wandb are such a useless scam :( Like, who needs to have "summary" confusion table, I need it per val step. matplotlib -> log as image, here we go again.
- Reposted by Dmytro MishkinIf Bilbo had Chat GPT
- Reposted by Dmytro Mishkin[Not loaded yet]
- ChatGPT has a sense of humor.
- git commit -m "why am I so stupid"
- Reposted by Dmytro Mishkin[Not loaded yet]
- I am reading now the reviews by fellow reviewers (I didn't submit for CVPR) in my batch, and, OMG, I see so many ChatGPT reviews.