ガボールフィルタ

ガボールフィルタ（英: Gabor filter）は、画像処理のテクスチャー解析等に用いられる線型フィルタの一種。（2次元のガボールフィルタでは）画像の各点周りの局所領域において、方向毎に特定の周波数成分を抽出することができる。虹彩認識や指紋認証にも応用されている他、哺乳類の脳の一次視覚野にある単純型細胞の活動をモデル化できることが示されている。名称はガーボル・デーネシュに因む^[1]。

定義編集

2次元ガボールフィルタのインパルス応答の例

細部が異なる種々の定義があるが、基本的にはガウス関数（ガウシアンエンベロープとも呼ばれる）と三角関数（搬送波とも呼ばれる）の積として定義される^[2]^[3]^[4]： ${\boldsymbol {x}}\in \mathbb {R} ^{n}(n=1,2)$ の函数であるガボールフィルタ $g$ は、 $w$ をガウス関数、 $s$ を三角関数として

$g({\boldsymbol {x}};A,{\boldsymbol {u}},P)=K\exp(iP)w({\boldsymbol {x}};A)s({\boldsymbol {x}};{\boldsymbol {u}})$

と定義される。ここで $K$ はガウシアンエンベロープの強度のスケール、 $A$ はガウシアンエンベロープのスケールや方向を指定する線形変換、 ${\boldsymbol {u}}$ は搬送波の（空間）周波数と方向を表す波数ベクトル、 $P$ は位相オフセットを表す。 $w$ と $s$ の具体的な形は、1次元の場合、通常スケールパラメータ $\sigma$ により

$w(x;\sigma )=\exp \left(-{\frac {x^{2}}{2\sigma ^{2}}}\right)$ ,

$s(x;u)=\exp \left(2\pi iux\right)$ ,

2次元の場合は、ガウシアンエンベロープの方向 $\theta$ と軸毎のスケールパラメータ $\sigma _{x},\sigma _{y}$ により

 $w(x,y;A(\theta ,\sigma _{x},\sigma _{y}))=\exp \left(-{\frac {x'^{2}+y'^{2}}{2}}\right)$ ,

$s(x,y;{\boldsymbol {u}})=\exp \left(2\pi i(x,y)\cdot {\boldsymbol {u}}\right)$

と表す。ここで $(x',y')=(x,y)A(\theta ,\sigma _{x},\sigma _{y})$ で、

$A(\theta ,\sigma _{x},\sigma _{y})={\begin{pmatrix}\cos \theta &-\sin \theta \\\sin \theta &\cos \theta \end{pmatrix}}{\begin{pmatrix}{\frac {1}{\sigma _{x}}}&0\\0&{\frac {1}{\sigma _{y}}}\end{pmatrix}}$

である。実部と虚部に分けた形^[4]や、更にガウシアンエンベロープの方向と波数ベクトルの方向とを一致させた形（ ${\boldsymbol {u}}=(f/\sigma _{x})(\cos \theta ,\sin \theta )$ ）も用いられる：

 $\operatorname {Re} g(x,y)=K\exp \left(-{\frac {x'^{2}+y'^{2}}{2}}\right)\cos(2\pi fx'+P)$ ,

 $\operatorname {Im} g(x,y)=K\exp \left(-{\frac {x'^{2}+y'^{2}}{2}}\right)\sin(2\pi fx'+P)$ ,

応用編集

画像処理では通常、異なる周波数、検出方向のガボールフィルタで構成されるフィルタバンクの形でよく利用される（これを画像に畳み込んで得られる出力をガボール空間という）。具体的事例は以下：

虹彩認識^[4]や指紋認証、顔の表情の認識^[5]。

髪型のモデリング^[6]。

脊椎の骨梁（trabecular bone）における方向性の分布の分析^[7]。

哺乳類の脳の一次視覚野にある単純型細胞の活動のモデル^[8]^[9]^[10] 。

スキャンした文書の中のテキスト部分の抽出（一般にテキスト部分が、画像部分に比べ相対的に高周波数成分が多いことを利用している）^[11]^[12]^[13]^[14]。

シーンを記述する低次元の大域特徴^[2]^[15]。

ガボールフィルタはガボールウェーブレットと直接関係しており、ガボールフィルタに回転や膨張を加えることで設計されるが、拡大は非常に時間がかかるため、ガボールウェーブレットでは適用されない^[要出典]。

実装例編集

scikit-imageにおける実装

MATLAB

function gb=gabor_fn(sigma_x,sigma_y,theta,lambda,psi,gamma)

sz_x=fix(6*sigma_x);
if mod(sz_x,2)==0, sz_x=sz_x+1;end
	

sz_y=fix(6*sigma_y);
if mod(sz_y,2)==0, sz_y=sz_y+1;end

[x y]=meshgrid(-fix(sz_x/2):fix(sz_x/2),fix(-sz_y/2):fix(sz_y/2));

% Rotation 
x_theta=x*cos(theta)+y*sin(theta);
y_theta=-x*sin(theta)+y*cos(theta);

gb=exp(-.5*(x_theta.^2/sigma_x^2+gamma^2*y_theta.^2/sigma_y^2)).*cos(2*pi/lambda*x_theta+psi);

脚注編集

^ D. Gabor (1946). “Theory of communications”. J. IEE London 93: 429-457.
^ ^a ^b 原田達也 (2017). 画像認識. 機械学習プロフェッショナルシリーズ. 講談社
^ Movellan, Javier R.. “Tutorial on Gabor Filters”. 2008年5月14日閲覧。
^ ^a ^b ^c Leveseque, Vincent (2000年12月6日). “Texture Segmentation Using Gabor Filters”. 2008年5月14日閲覧。
^ Lyons, M.; Akamatsu, S.; Kamachi, M.; Gyoba, J. (1998). Coding facial expressions with Gabor wavelets. pp. 200–205. doi:10.1109/AFGR.1998.670949. ISBN 0-8186-8344-9. OL OL11390549M. https://zenodo.org/record/3430156
^ Menglei Chai and Lvdi Wang and Yanlin Weng and Yizhou Yu and Baining Guo and Kun Zhou (2012). “Single-view hair modeling for portrait manipulation”. ACM Trans. Graph. 31 (116): 1-8. doi:10.1145/2185520.2185612.
^ C.M.Gdyczynski, etal (2014). “On estimating the directionality distribution in pedicle trabecular bone from micro-CT images”. Journal of Physiological Measurements 35 (12): 2415–2428. Bibcode: 2014PhyM...35.2415G. doi:10.1088/0967-3334/35/12/2415. PMID 25391037.
^ Jones JP, Palmer LA (Dec 1987). “An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex”. J Neurophysiol 58 (6): 1233-1258.
^ Marčelja, S. (1980). “Mathematical description of the responses of simple cortical cells”. Journal of the Optical Society of America 70 (11): 1297–1300. Bibcode: 1980JOSA...70.1297M. doi:10.1364/JOSA.70.001297. PMID 7463179.
^ Daugman, John G. (1985-07-01). “Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters” (英語). Journal of the Optical Society of America A 2 (7): 1160–9. Bibcode: 1985JOSAA...2.1160D. doi:10.1364/JOSAA.2.001160. ISSN 1084-7529. PMID 4020513.
^ Pati, Peeta Basa; Ramakrishnan, A.G. (July 2008). “Word level multi-script identification” (英語). Pattern Recognition Letters 29 (9): 1218–1229. doi:10.1016/j.patrec.2008.01.027. ISSN 0167-8655.
^ Raju S, S.; Pati, P.B.; Ramakrishnan, A.G. (2004). “Gabor filter based block energy analysis for text extraction from digital document images”. First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings. (Palo Alto, CA, USA: IEEE): 233–243. doi:10.1109/DIAL.2004.1263252. ISBN 978-0-7695-2088-9. LCCN 2003-116308. OL OL8067708M.
^ Raju, S. Sabari; Pati, P. B.; Ramakrishnan, A. G. (2005). “Text Localization and Extraction from Complex Color Images”. Lecture Notes in Computer Science 3804: 486–493. doi:10.1007/11595755_59. ISBN 978-3-540-30750-1. ISSN 0302-9743. LCCN 2005-936803. OL OL9056158M.
^ S Sabari Raju, P B Pati and A G Ramakrishnan, “Text Localization and Extraction from Complex Color Images,” Proc. First International Conference on Advances in Visual Computing (ISVC05), Nevada, USA, LNCS 3804, Springer Verlag, Dec. 5-7, 2005, pp. 486-493.
^ Oliva, A., Torralba (2001). “Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope”. International Journal of Computer Vision 42: 145-175. doi:10.1023/A:1011139631724.

外部リンク編集

初期視覚機能をモデル化したガボールフィルタLSI 九州工業大学大学院生命体工学研究科森江研究室
Web-enabled image processing operators 2次元のガボールフィルタを画像に適用した結果を見ることができる。

この項目は、数学に関連した書きかけの項目です。この項目を加筆・訂正などしてくださる協力者を求めています（プロジェクト:数学／Portal:数学）。

[1] D. Gabor (1946). “Theory of communications”. J. IEE London 93: 429-457.

[hara2017-2] 原田達也 (2017). 画像認識. 機械学習プロフェッショナルシリーズ. 講談社

[3] Movellan, Javier R.. “Tutorial on Gabor Filters”. 2008年5月14日閲覧。

[VL00-4] Leveseque, Vincent (2000年12月6日). “Texture Segmentation Using Gabor Filters”. 2008年5月14日閲覧。

[5] Lyons, M.; Akamatsu, S.; Kamachi, M.; Gyoba, J. (1998). Coding facial expressions with Gabor wavelets. pp. 200–205. doi:10.1109/AFGR.1998.670949. ISBN 0-8186-8344-9. OL OL11390549M. https://zenodo.org/record/3430156

[Chai2012SingleviewHM-6] Menglei Chai and Lvdi Wang and Yanlin Weng and Yizhou Yu and Baining Guo and Kun Zhou (2012). “Single-view hair modeling for portrait manipulation”. ACM Trans. Graph. 31 (116): 1-8. doi:10.1145/2185520.2185612.

[7] C.M.Gdyczynski, etal (2014). “On estimating the directionality distribution in pedicle trabecular bone from micro-CT images”. Journal of Physiological Measurements 35 (12): 2415–2428. Bibcode: 2014PhyM...35.2415G. doi:10.1088/0967-3334/35/12/2415. PMID 25391037.

[8] Jones JP, Palmer LA (Dec 1987). “An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex”. J Neurophysiol 58 (6): 1233-1258.

[9] Marčelja, S. (1980). “Mathematical description of the responses of simple cortical cells”. Journal of the Optical Society of America 70 (11): 1297–1300. Bibcode: 1980JOSA...70.1297M. doi:10.1364/JOSA.70.001297. PMID 7463179.

[10] Daugman, John G. (1985-07-01). “Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters” (英語). Journal of the Optical Society of America A 2 (7): 1160–9. Bibcode: 1985JOSAA...2.1160D. doi:10.1364/JOSAA.2.001160. ISSN 1084-7529. PMID 4020513.

[11] Pati, Peeta Basa; Ramakrishnan, A.G. (July 2008). “Word level multi-script identification” (英語). Pattern Recognition Letters 29 (9): 1218–1229. doi:10.1016/j.patrec.2008.01.027. ISSN 0167-8655.

[12] Raju S, S.; Pati, P.B.; Ramakrishnan, A.G. (2004). “Gabor filter based block energy analysis for text extraction from digital document images”. First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings. (Palo Alto, CA, USA: IEEE): 233–243. doi:10.1109/DIAL.2004.1263252. ISBN 978-0-7695-2088-9. LCCN 2003-116308. OL OL8067708M.

[13] Raju, S. Sabari; Pati, P. B.; Ramakrishnan, A. G. (2005). “Text Localization and Extraction from Complex Color Images”. Lecture Notes in Computer Science 3804: 486–493. doi:10.1007/11595755_59. ISBN 978-3-540-30750-1. ISSN 0302-9743. LCCN 2005-936803. OL OL9056158M.

[14] S Sabari Raju, P B Pati and A G Ramakrishnan, “Text Localization and Extraction from Complex Color Images,” Proc. First International Conference on Advances in Visual Computing (ISVC05), Nevada, USA, LNCS 3804, Springer Verlag, Dec. 5-7, 2005, pp. 486-493.

[15] Oliva, A., Torralba (2001). “Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope”. International Journal of Computer Vision 42: 145-175. doi:10.1023/A:1011139631724.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

ガボールフィルタ

目次

定義編集

応用編集

実装例編集

脚注編集

関連項目編集

外部リンク編集

ガボールフィルタ

定義 編集

応用 編集

実装例 編集

脚注 編集

関連項目 編集

外部リンク 編集

定義編集

応用編集

実装例編集

脚注編集

関連項目編集

外部リンク編集