Paper Reading List (After Oct., 2023)

2023-11-12 0 0 Comments Word Count: 627(words) Read Count: 3(minutes)

论文阅读（Papers）

Title	Keyword	Venue	Link
Learning Multi-Scene Absolute Pose Regression with Transformers	Camera Localization; Transformer; Multi-Scene APR	ICCV'21	[Paper] [Code]
Image-Based Localization Using Hourglass Networks	Camera Localization; Encoder-Decoder CNN;	ICCVW'17	[Paper]
Geometric Loss Functions for Camera Pose Regression with Deep Learning	Geometric Reprojection; Homoscedastic Uncertainty; Camera Localization; CNN	CVPR'17	[Paper]
Image-Based Localization Using LSTMs for Structured Feature Correlation	Camera Localization; LSTM; CNN; Large-Scale Indoor Localization	ICCV'17	[Paper][Dataset]
Sequential Modeling Enables Scalable Learning for Large Vision Models	Large Vision Model (LVM); Unified Vision Dataset (UVD); Auto-Regressive; Pretrained; Transformer	arXiv'23	[Paper] [Code] [Dataset] [Project]
Learning to Detect Scene Landmarks for Camera Localization	Camera Localization; Scene Landmarks; CNN; Indoor-6	CVPR'22	[Paper] [Code] [Dataset]
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions	GPT-4V; Large Multi-modal Models (LMMs); Pre-training; Image Captioning	arXiv'23	[Paper] [Website]
From Coarse to Fine: Robust Hierarchical Localization at Large Scale	Camera Localization; 6-DoF; SfM; CNN; Hierarchical Framework	CVPR'19	[Paper] [Code]
NavMarkAR: A Landmark-based Augmented Reality (AR) Wayfinding System for Enhancing Spatial Learning of Older Adults	HCI; Augmented Reality (AR); Indoor Navigation; Multimodal Interaction; Older Adults	arXiv'23	[Paper]
SparsePose: Sparse-View Camera Pose Regression and Refinement	Camera Pose Regression; Sparse Set; Wide-Baseline Images	CVPR'23	[Paper] [Code]
Image Captioning State-of-the-Art: Is it enough for the Guidance of Visually Impaired in an Environment?	Image Captioning; VIPs	CSA'22	[Paper]
Grounding Answers for Visual Questions Asked by Visually Impaired People	Visual Question Answering (VQA); Answer Grounding; VIPs; Viz-Wiz-VQA-Grounding	CVPR'22	[Paper] [Dataset]
Structure-from-Motion Revisited	SfM; 3D Reconstruction; COLMAP	CVPR'16	[Paper] [Code (COLMAP)]
ViNav: A Vision-Based Indoor Navigation System for Smartphones	Indoor Navigation System; Smartphone System; SfM; OCR; Dead Reckoning; WiFi Fingerprinting	TMC'18	[Paper]
Understanding the Limitations of CNN-based Absolute Camera Pose Regression	PoseNet; MapNet; Image Retrieval; Absolute Camera pose Regression (APR); Relative camera Pose Regression (RPR)	CVPR'19	[Paper] [Code]
A Preliminary Study on the Possibility of Scene Captioning Model Integration as an Improvement in Assisted Navigation for Visually Impaired Users	Visually Impaired (VI); NAS; Image Captioning; RGB-D	AsiaSim'23	[Paper]
ClipCap: CLIP Prefix for Image Captioning	Image Captioning; Prefix; CLIP; GPT; VLP;	arXiv'21	[Paper] [Code]
"I Want to Figure Things Out": Supporting Exploration in Navigation for People with Visual Impairments	Human Computer Interaction (HCI); Visually Impaired People (VIPs); Navigation Assistance Systems (NASs); Accessibility	PACMHCI'23	[Paper]
Segment Anything	Promptable Segmentation; PFM; SA-1B	ICCV'23	[Paper] [Code] [Website]
Camera Pose Auto-Encoders for Improving Pose Regression	Camera Pose Regression; Auto-Encoders; Image Reconstruction	ECCV'22	[Paper] [Code]
PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization	Camera Pose Regression; 6-DoF; CNN; Cambridge Landmarks	ICCV'15	[Paper] [Code] [Dataset]
Real-time Vision-based Navigation for a Robot in an Indoor Environment	SAM; Obstacle Avodiance; BEV; Path Planning; A* Algorithm	arXiv'23	[Paper] [Code] [Appendix] [Dataset] [Video]
IBeaconMap: Automated Indoor Space Representation for Beacon-Based Wayfinding	Indoor Navigation; Beacon Planning; Floor Plan	ICCHP'20	[Paper]

综述阅读（Surveys）

Title	Keyword	Venue	Link
Vision-language navigation: a survey and taxonomy	Vision-and-Language Navigation (VLN); Taxonomy	NCA'23	[Paper]
Indoor Navigation Systems for Visually Impaired Persons: Mapping the Features of Existing Technologies to User Needs	VIPs; Indoor Navigation System; Sensor; Assisstive Device; Meta Analysis	Sensors'20	[Paper]
A Critical Analysis of Image-based Camera Pose Estimation Techniques	Camera Pose Regression; Structure-Based Localization; Absolute/Relative Pose Regression	arXiv'22	[Paper]

基准测试（Benchmarking）

Title	Keyword	Venue	Link
Fiducial Markers for Pose Estimation: Overview, Applications and Experimental Comparison of the ARTag, AprilTag, ArUco and STag Markers	Fiducial Markers; Pose Estimation; Localization	JINT'21	[Paper]

本文链接： https://junfish.github.io/Paper Reading List/index.html

版权声明： 本博客所有文章除特别声明外，均采用 CC BY 4.0 CN协议许可协议。转载请注明出处！

Jason

CREAM

Paper Reading List (After Oct., 2023)

论文阅读（Papers）

综述阅读（Surveys）

基准测试（Benchmarking）

JasonCREAM