本次课程内容相机成像模型小孔成像、相机成像参数单目视觉定位（测距）方法局限性、难点双目视觉方法原理.

Slides:

Advertisements

Similar presentations

Basic concepts of structural equation modeling

Advertisements

CATIA V5 Training CATIA V5 装配设计 Assembly Design.

人脸识别--LBP 周稻祥.

Chapter 8 Liner Regression and Correlation 第八章直线回归和相关

A Novel Geographic Routing Strategy over VANET

Operating System CPU Scheduing - 3 Monday, August 11, 2008.

A TIME-FREQUENCY ADAPTIVE SIGNAL MODEL-BASED APPROACH FOR PARAMETRIC ECG COMPRESSION 14th European Signal Processing Conference (EUSIPCO 2006), Florence,

Introduction To Mean Shift

Applications of Digital Signal Processing

Image Retrieval Based on Fractal Signature

模式识别 Pattern Recognition

Manifold Learning Kai Yang

Digital Terrain Modeling

D. Halliday, R. Resnick, and J. Walker

第十章基于立体视觉的深度估计.

Vanishing Point (Line)

3D Model Wan-Yu Chen NTUEE.

Chapter 2. The Graphics Rendering Pipeline 图形绘制流水线

非線性規劃 Nonlinear Programming

Seam Carving for Content-Aware Image Resizing

第二节边缘和线特征提取.

第十章基于立体视觉的深度估计.

第二章共轴球面系统的物像关系 Chapter 2: Object-image relations of coaxial spheric system.

Digital Image Processing

普通物理 General Physics 27 - Circuit Theory

Fundamentals of Physics 8/e 27 - Circuit Theory

第八章 Illumination and Shading

HLA - Time Management 陳昱豪.

Image Segmentation with A Bounding Box Prior

信号与图像处理基础 An Introduction to Signal and Image Processing 中国科学技术大学自动化系

Shape(Structure) From X

射影幾何於攝影測量上之應用 Projective Geometry in Photogrammetry

光流法 (Optical Flow) 第八章基于运动视觉的稠密估计光流法 (Optical Flow)

Step 1. Semi-supervised Given a region, where a primitive event happens Given the beginning and end time of each instance of the primitive event.

Short Version : 6. Work, Energy & Power 短版: 6. 功，能和功率

Particle Systems 粒子系统李博杰 PB

9.4 基于纹理的深度图重建.

机器人学基础第四章机器人动力学 Fundamentals of Robotics Ch.4 Manipulator Dynamics

第三章基本觀念電腦繪圖與動畫 (Computer Graphics & Animation) Object Data Image

光学设计软件 —— ZEMAX 简介高宏实验室：中西安交通大学理学院.

第三节视觉系统的几何特性.

Formal Pivot to both Language and Intelligence in Science

塑膠材料的種類塑膠在模具內的流動模式流動性質的影響溫度性質的影響

ICG 2018 Fall Homework1 Guidance

磁共振原理的临床应用福建医科大学附属第一医院影像科方哲明.

普通物理 General Physics 22 - Finding the Electric Field－I

陳明璋一個引導注意力為導向的數位內容設計及展演環境 Activate Mind Attention AMA

第八章圖形識別、匹配與三維影像重建.

句子成分的省略（1）.

第三章基本觀念電腦繪圖與動畫 (Computer Graphics & Animation) Object Data Image

IBM SWG Overall Introduction

Shape(Structure) From X

Version Control System Based DSNs

3.5 Region Filling Region Filling is a process of “coloring in” a definite image area or region. 2019/4/19.

線性規劃模式 Linear Programming Models

Simple Regression (簡單迴歸分析)

3D Game Programming Projection

运动学第一章 chapter 1 kinematices.

第九章明暗分析 Shape from Shading SFS SFM SFC SFT …… SFX.

結合光達資料與航空影像重建屋頂面之研究指導教授：趙鍵哲　　　　　　　　　　　　　　　　　　　　　　　　學生姓名：鄭傑中.

中国科学院自动化研究所模式识别国家重点实验室

摄像机标定和三维重建胡占义中国科学院自动化研究所模式识别国家重点实验室.

Efficient Query Relaxation for Complex Relationship Search on Graph Data 李舒馨

钱炘祺一种面向实体浏览中属性融合的人机交互的设计与实现 Designing Human-Computer Interaction of Property Consolidation for Entity Browsing 钱炘祺

动词不定式（6）.

簡單迴歸分析與相關分析莊文忠副教授世新大學行政管理學系計量分析一(莊文忠副教授) 2019/8/3.

Principle and application of optical information technology

Gaussian Process Ruohua Shi Meeting

Hybrid fractal zerotree wavelet image coding

Presentation transcript:

本次课程内容相机成像模型小孔成像、相机成像参数单目视觉定位（测距）方法局限性、难点双目视觉方法原理

“午”即小孔所在处。这段文字表明小孔成的是倒像，其原因是在小孔处光线交叉的地方有一点（“端”），成像的大小，与这交点的位置无关。

一般情况下单相机无法正确获得深度信息！

特殊情况下单相机可以计算深度信息比如：x 或 y 已知

简单计算球位置的方法！（x,y） Z Y （X,Y,Z）如果空间点的 Y 坐标已知,应用可以计算出空间点的距离 Z Image plane f Y （X,Y,Z）如果空间点的 Y 坐标已知,应用可以计算出空间点的距离 Z 简单计算球位置的方法！

简单计算位置的方法（x,y） Z Y （X,Y,Z） Y取机器人身高（或实际测量相机高度），y可以从图像读取，f 可以标定，则，同样也

单目方法存在的问题（x,y） Z Y （X,Y,Z） Y（即相机距离地面高度）无法精确测量相机主光轴与地面夹角无法测量 f Y （X,Y,Z） Y（即相机距离地面高度）无法精确测量相机主光轴与地面夹角无法测量导致距离 Z 不准确。

像机标定

坐标系 y u x v O 1、世界坐标系： 2、摄像机坐标系： 3、图像坐标系: 世界坐标系说明：为了校正成像畸变用理想图像坐标系和真实图像坐标系分别描述畸变前后的坐标关系

摄像机光学成像过程的四个步骤世界坐标系刚体变换摄像机坐标系透视投影理想图像坐标系畸变校正真实图像坐标系数字化图像 1、刚体变换公式刚体变换透视投影畸变校正数字化图像世界坐标系摄像机坐标系真实图像坐标系数字化图像坐标系理想图像坐标系齐次坐标形式

2、透视投影——透镜成像原理图一般地由于于是这时可以将透镜成像模型近似地用小孔模型代替物体 O C B A 图像一般地由于于是这时可以将透镜成像模型近似地用小孔模型代替 f=OB 为透镜的焦距 m=OC 为像距 n=AO 为物距

2、透视投影——小孔成像模型 o 写成齐次坐标形式为

2、中心透视投影模型 o f 写成齐次坐标形式为

3、畸变校正——径向和切向畸变径向畸变径向失真离心畸变切向失真薄透镜畸变 dr :radial distortion Ideal Position Position with distortion dr :radial distortion dt :tangential distortion

3、畸变校正——其它畸变类型现在常用模型 a b a :barrel distortion b :pincushion distortion a b 桶形畸变a和枕形畸变b 桶形畸变枕形畸变现在常用模型

3、畸变校正——其它畸变类型

4、图像数字化在中的坐标为象素在轴上的物理尺寸为 C V Affine Transformation : 齐次坐标形式: U 其中

摄像机的内参数矩阵 K

线性摄像机成像模型图像像素坐标系图像物理坐标系摄像机坐标系世界坐标系最终得到：图像像素坐标系世界坐标系这是忽略畸变的线性成像模型

张正友的平面标定方法 http://research.microsoft.com/en-us/um/people/zhang/Calib/

张正友的平面标定方法基本原理：在这里假定模板平面在世界坐标系的平面上在这里假定模板平面在世界坐标系的平面上其中，为摄像机的内参数矩阵，为模板平面上点的齐次坐标，为模板平面上点投影到图象平面上对应点的齐次坐标，和分别是摄像机坐标系相对于世界坐标系的旋转矩阵和平移向量

张正友的平面标定方法其中根据旋转矩阵的性质，即和，每幅图象可以获得以下两个对内参数矩阵的基本约束根据旋转矩阵的性质，即和，每幅图象可以获得以下两个对内参数矩阵的基本约束由于摄像机有5个未知内参数，所以当所摄取得的图象数目大于等于3时，就可以线性唯一求解出

张正友的平面标定方法张正友方法所用的平面模板

张正友的平面标定方法算法描述打印一张模板并贴在一个平面上从不同角度拍摄若干张模板图象检测出图象中的特征点求出摄像机的内参数和外参数求出畸变系数优化求精

http://www. vision. caltech. edu/bouguetj/calib_doc/htmls/example2 http://www.vision.caltech.edu/bouguetj/calib_doc/htmls/example2.html

其它标定方法： http://www.vision.caltech.edu/bouguetj/calib_doc/htmls/links.html

多视立体视觉(测距)方法

Review: Pinhole Camera

Review: Perspective Projection

Review: Intrinsic Camera Parameters Y M Image plane Z C v X m Focal plane u

Review: Extrinsic Parameters Y M Image plane Y Z C v X X Z m Focal plane u By Rigid Body Transformation:

Review: Perspective Projection Points go to Points Lines go to Lines Planes go to whole image or Half-planes Polygons go to Polygons 但是, 平行线相交!

Perspective cues

Perspective cues

Perspective cues

Recovering 3D from images What cues in the image provide 3D information?

Merle Norman Cosmetics, Los Angeles Visual cues Shading Merle Norman Cosmetics, Los Angeles

The Visual Cliff, by William Vandivert, 1960 Visual cues Shading Texture The Visual Cliff, by William Vandivert, 1960

From The Art of Photography, Canon Visual cues Shading Texture Focus From The Art of Photography, Canon

Visual cues Shading Texture Focus Motion

Visual cues Shading Texture Focus Motion Shape From X X = shading, texture, focus, motion, ...

Fundamentals of Stereo Vision A camera model: Models how 3-D scene points are transformed into 2-D image points The pinhole camera: a simple linear model for perspective projection

Fundamentals of Stereo Vision The goal of stereo analysis: The inverse process: From 2-D image coordinates to 3-D scene coordinates Requires images from at least two views

Fundamentals of Stereo Vision 3-D reconstruction

Fundamentals of Stereo Vision

Fundamentals of Stereo Vision

Fundamentals of Stereo Vision

Fundamentals of Stereo Vision

Fundamentals of Stereo Vision

Fundamentals of Stereo Vision

Fundamentals of Stereo Vision

Multi-View Geometry Relates 3D World Points Camera Centers Camera Orientations Camera Centers

Multi-View Geometry Relates 3D World Points Camera Centers Camera Intrinsic Parameters Image Points Camera Orientations

Binocular Stereo Gives reconstruction as intersection of two rays scene point image plane optical center Basic Principle: Triangulation Gives reconstruction as intersection of two rays Requires calibration point correspondence

Stereo_Two subproblems Matching (hardest) Finding corresponding elements in the two images Reconstruction Establishing 3-D coordinates from the 2-D image correspondences found during matching

Stereo Constraints p’ p ? Given p in left image, where can the corresponding point p’ in right image be?

Stereo Constraints M Image plane Epipolar Line Y1 p p’ Y2 X2 O1 Z1 X1 Epipole Focal plane

Epipolar Constraint

From Geometry to Algebra P p p’

Reconstruction up to a Scale Factor Assume that intrinsic parameters of both cameras are known Essential Matrix is known up to a scale factor (for example, estimated from the 8 point algorithm).

From Geometry to Algebra P p p’

Linear Constraint: Should be able to express as matrix multiplication.

Review: Matrix Form of Cross Product 正交

Review: Matrix Form of Cross Product

Matrix Form

The Essential Matrix

Reconstruction O O’ p p’

Reconstruction Equation 1 Equation 2 (From equations 1 and 2)

Stereo image rectification Image Reprojection reproject image planes onto common plane parallel to line between optical centers a homography (3x3 transform) applied to both input images pixel motion is horizontal after this transformation C. Loop and Z. Zhang. Computing Rectifying Homographies for Stereo Vision. IEEE Conf. Computer Vision and Pattern Recognition, 1999.

Image Rectification Common Image Plane Parallel Epipolar Lines Search Correspondences on scan line

A Simple Stereo System Right image: Left image: target reference LEFT CAMERA RIGHT CAMERA baseline Elevation Zw disparity Depth Z Right image: target Left image: reference Zw=0 Bahadir K. Gunturk

Stereo View Left View Right View Disparity

Stereo Disparity The separation between two matching objects is called the stereo disparity.

Parallel Cameras P Z xl xr f pl pr Ol Or Disparity: T T is the stereo baseline

Correlation Approach LEFT IMAGE (xl, yl) (0). Essential Equation represents actually the epipolar plane in either the left or the right image (1). Epipolar line in the right image given pl (Epl)Tpr=0 zr = fr extension of the equations in pr = (xr,yr,fr) (2). Epipolar line in the left image given pr (prTE) pl=0 zl = fl For Each point (xl, yl) in the left image, define a window centered at the point Bahadir K. Gunturk

Correlation Approach RIGHT IMAGE (xl, yl) (0). Essential Equation represents actually the epipolar plane in either the left or the right image (1). Epipolar line in the right image given pl (Epl)Tpr=0 zr = fr extension of the equations in pr = (xr,yr,fr) (2). Epipolar line in the left image given pr (prTE) pl=0 zl = fl … search its corresponding point within a search region in the right image Bahadir K. Gunturk

Correlation Approach RIGHT IMAGE (xr, yr) dx (xl, yl) (0). Essential Equation represents actually the epipolar plane in either the left or the right image (1). Epipolar line in the right image given pl (Epl)Tpr=0 zr = fr extension of the equations in pr = (xr,yr,fr) (2). Epipolar line in the left image given pr (prTE) pl=0 zl = fl … the disparity (dx, dy) is the displacement when the correlation is maximum Bahadir K. Gunturk

Comparing Windows ? = g f Minimize Sum of Squared Differences Maximize Cross correlation

Correspondence using Discrete Search

Sum of Squared Differences (SSD)

Feature-based correspondence Features most commonly used: Corners Similarity measured in terms of: surrounding gray values (SSD, Cross-correlation) location Edges, Lines orientation contrast coordinates of edge or line’s midpoint length of line

Feature-based Approach LEFT IMAGE corner line structure (0). Essential Equation represents actually the epipolar plane in either the left or the right image (1). Epipolar line in the right image given pl (Epl)Tpr=0 zr = fr extension of the equations in pr = (xr,yr,fr) (2). Epipolar line in the left image given pr (prTE) pl=0 zl = fl For each feature in the left image…

Feature-based Approach RIGHT IMAGE corner line structure (0). Essential Equation represents actually the epipolar plane in either the left or the right image (1). Epipolar line in the right image given pl (Epl)Tpr=0 zr = fr extension of the equations in pr = (xr,yr,fr) (2). Epipolar line in the left image given pr (prTE) pl=0 zl = fl Search in the right image… the disparity (dx, dy) is the displacement when the similarity measure is maximum

Correspondence Difficulties Why is the correspondence problem difficult? Some points in each image will have no corresponding points in the other image. (1) the cameras might have different fields of view. (2) due to occlusion. A stereo system must be able to determine the image parts that should not be matched.

Stereo results Data from University of Tsukuba Scene Ground truth (Seitz)

Results with window correlation Estimated depth of field Ground truth (Seitz)

Results with better method A state of the art method Boykov et al., Fast Approximate Energy Minimization via Graph Cuts, International Conference on Computer Vision, September 1999. Ground truth (Seitz)

Other constraints It is possible to put some constraints. For example: smoothness. (Disparity usually doesn’t change too quickly.)

Reconstruction up to a Scale Factor Assume that intrinsic parameters of both cameras are known Essential Matrix is known up to a scale factor (for example, estimated from the 8 point algorithm).

Reconstruction up to a Scale Factor

Reconstruction up to a Scale Factor Let It can be proved that

Reconstruction up to a Scale Factor We have two choices of t, (t+ and t-) because of sign ambiguity and two choices of E, (E+ and E-). This gives us four pairs of translation vectors and rotation matrices.

Reconstruction up to a Scale Factor Given and Construct the vectors w, and compute R Reconstruct the Z and Z’ for each point If the signs of Z and Z’ of the reconstructed points are both negative for some point, change the sign of and go to step 2. different for some point, change the sign of each entry of and go to step 1. both positive for all points, exit.