书签分享收藏举报版权申诉 / 22

立即下载加入VIP,免费下载

当前位置：首页 > 考试认证 > IT认证 > 论文翻译.docx

论文翻译.docx

文档编号：30250508
上传时间：2023-08-13
格式：DOCX
页数：22
大小：39.25KB

论文翻译.docx

《论文翻译.docx》由会员分享，可在线阅读，更多相关《论文翻译.docx（22页珍藏版）》请在冰豆网上搜索。

论文翻译.docx

论文翻译

UnsupervisedObjectDiscoveryandLocalizationintheWild:

Part-basedMatchingwithBottom-upRegionProposals

MinsuCho1,

SuhaKwak1,

CordeliaSchmid1,y

JeanPonce2,

1

Inria

2´

EcoleNormaleSuperieure´/PSLResearchUniversity

野外无监督的目标发现与定位：

一部分基于自下而上的区域匹配

Abstract

Thispaperaddressesunsuperviseddiscoveryandlocalizationofdominantobjectsfromanoisyimagecollectionwithmultipleobjectclasses.Thesettingofthisproblemisfullyunsupervised,withoutevenimagelevelannotationsoranyassumptionofasingledominantclass.Thisisfarmoregeneralthantypicalcolocalization,cosegmentation,orweakly-supervisedlocalizationtasks.Wetacklethediscoveryandlocalizationproblemusingapartbasedregionmatchingapproach:

Weuseoff-the-shelfregionproposalstoformasetofcandidateboundingboxesforobjectsandobjectparts.TheseregionsareefficientlymatchedacrossimagesusingaprobabilisticHoughtransformthatevaluatestheconfidenceforeachcandidatecorrespondenceconsideringbothappearanceandspatialconsistency.Dominantobjectsarediscoveredandlocalizedbycomparingthescoresofcandidateregionsandselectingthosethatstandoutoverotherregionscontainingthem.Extensiveexperimentalevaluationsonstandardbenchmarksdemonstratethattheproposedapproachsignificantlyoutperformsthecurrentstateoftheartincolocalization,andachievesrobustobjectdiscoveryinchallengingmixed-classdatasets.

摘要

本文针对无监督的发现及从多对象类噪声图像采集中对主要目标的定位。

这个问题的设置是完全无监督的，甚至没有图像级的注释或任何一个单一的显性类的假设。

这是远比典型的联合定位，联合分割，弱监督定位任务更普遍。

我们处理的发现和定位问题，采用基于区域匹配方法：

我们使用现成的区域提案，以形成一组候选边界框的对象和对象的部分。

这些区域被有效地匹配在图像使用概率霍夫变换，计算每个候选对应信心考虑外观、空间一致性。

主要目标的发现和定位是通过比较候选区的得分并选择那些站在其他区域以外的候选区。

广泛的实验评估标准的基准测试表明，该方法明显优于现在的联合定位，在具有挑战性的混合类数据实现鲁棒的目标发现。

1.Introduction

Objectlocalizationanddetectionishighlychallengingbecauseofintraclassvariations,backgroundclutter,andocclusionspresentinrealworldimages.Whilesignificantprogresshasbeenmadeinthisareaoverthelastdecade,asshownbyrecentbenchmarkresults[11,16],moststate-of-the-artmethodsstillrelyonstrongsupervisionintheformofmanually-annotatedboundingboxesontargetinstances.Sincethosedetailedannotationsareexpensivetoacquireandalsopronetounwantedbiasesanderrors,recentworkhasexploredtheproblemofweakly-supervisedobjectdiscoverywhereinstancesofanobjectclassarefoundinacollectionofimageswithoutanybox-levelannotations.Typically,weakly-supervisedlocalization[9,35,36,43,45,56]requirespositiveandnegativeimage-levellabelsforatargetobjectclass.Ontheotherhand,cosegmentation[25,29,40]andcolocalization[12,27,51]assumelesssupervisionandonlyrequiretheimagecollectiontocontainasingledominantobjectclass,allowingnoisyimagestosomedegree.

1.简介

目标定位和检测是非常具有挑战性的，因为类内的变化，背景杂乱，和在现实世界中的图像遮挡。

虽然在过去十年中已经取得了重大进展，如最近的基准测试结果[16，11]，大多数国家最先进的方法仍然依赖于强大的监督，在目标的情况下，手动标注边界框的形式。

由于这些详细的注释是需要很大代价，也容易出现不必要的偏差和错误，最近的工作研究了弱监督的目标发现问题，在采集的图像中对象类的实例中发现的图像的集合，而无需任何边框注释。

典型地，弱监督定位[9，35，36，43，45，56]需要正面和负面的目标对象类图像的标签。

另一方面，cosegmentation[25，29，40]和[12，27，51共定位，承担更少的监督和]只需要图像采集包含一个单一的显性对象类，允许一定程度的噪声图像。

Thispaperaddressesunsupervisedobjectlocalizationinafarmoregeneralscenariowhereagivenimagecollectioncontainmultipledominantobjectclassesandevennoisyim-ageswithoutanytargetobjects.AsillustratedinFig.1,thesettingofthisproblemisfullyunsupervised,withoutanyimage-levelannotations,anassumptionofasingledomi-nantclass,orevenaknownnumberofobjectclasses.Inspiteofthisgenerality,theproposedmethodmarkedlyout-performsthestateoftheartsincolocalization[27,51]onstandardbenchmarks[16,40],andcloselycompeteswithcurrentweakly-supervisedlocalization[9,43,56].

本文讨论了无监督的对象定位在一个更一般的场景，一个给定的图像采集包含多个主要的对象类，甚至嘈杂的我年龄没有任何目标对象。

如图1所示，此问题的设置是完全无监督的，没有任何形象层面的注释，一个单一的显性类的一个假设，甚至是一个已知的对象类。

在这种普遍性的怨恨，该方法明显优于共存[27国家的艺术，51]标准的基准测试[16，40]，并密切竞争与电流弱监督定位[9，43，56]。

Weadvocateapart-basedmatchingapproachtounsuper-visedobjectdiscoveryusingbottom-upregionproposals.Multi-scaleregionproposalshavebeenwidelyusedbeforetorestrictthesearchspaceforobjectboundingboxesinob-jectrecognition[9,20,53]andlocalization[9,27,51,54].Wegofurtherandproposeheretousetheseregionstoformasetofcandidateregionsnotonlyforobjects,butalsoforobjectparts.WeuseaprobabilisticHoughtransorm[2]tomatchthosecandidateregionsacrossimages,andassignthemconfidencescoresreflectingbothappearanceandspa-tialconsistency.Thiscanbeseenasanunsupervisedandef-ficientvariantofbothdeformablepartmodels[18,19]andgraphmatchingmethods[5,14].Objectsarediscoveredandlocalizedbyselectingthemostsalientregionsthatcon-taincorrespondingparts.Tothisend,weintroduceascorethatmeasureshowmucharegionstandsoutoverotherre-gionscontainingit.Theproposedalgorithmalternatesbe-tweenpart-basedregionmatchingandforegroundlocaliza-tion,improvingbothoveriterations.

我们提倡一部分以进行修正使用自下而上的区域建议对象发现匹配方法。

多尺度区域的建议已经被广泛使用之前，限制搜索空间的对象包围盒在转播—

对象识别[9，20，53]和[27，9定位，51，54]。

我们进一步提出使用这些区域，以形成一组候选区域，不仅为对象，但也为对象的部分。

我们用一个概率Hough变换[2]相匹配的候选区域的图像，并给他们信心分数反映的外观和空间一致性。

这可以看出，作为一种无监督和EF的变形的零件模型[18的变体，19]和图匹配的方法[5，14]。

对象的发现和选择所包含的相应部分的最显着的地区定位。

为此，我们引入了一个分数，衡量区域站在其他区域包含它。

该算法在基于区域匹配之间的前景和定位，提高迭代。

Themaincontributionsofthispapercanbesummarizedasfollows:

（1）Apart-basedregionmatchingapproachtounsupervisedobjectdiscoveryisintroduced.

（2）Aneffi-cientandrobustmatchingalgorithmbasedonaprobabilis-ticHoughtransformisproposed.（3）Astandoutscoreforrobustforegroundlocalizationisintroduced.（4）Objectdis-coveryandlocalizationinafullyunsupervisedsetupisex-ploredonchallengingbenchmarkdatasets[16,40].

本文的主要贡献概括如下：

（1）介绍了一种基于区域匹配方法的无监督目标发现。

（2）一个高效鲁棒匹配算法基于随机Hough变换算法。

（3）介绍了一种强大的前景定位出色的成绩。

（4）在一个完全无监督的设置对象的发现和定位的探索具有挑战性的基准数据集[16，40]。

2.Relatedwork

Unsupervisedobjectdiscoveryhaslongbeenattemptedincomputervision.Sivicetal.[48]andRusselletal.[42]applystatisticaltopicdiscoverymodels.GraumanandDarrel[21]usepartialcorrespondenceandclusteringoflo-calfeatures.KimandTorralba[28]employalinkanaly-sistechnique.FaktorandIrani[17]proposeclusteringbycomposition.Unsupervisedobjectdiscovery,however,hasprovenextremelydifficult“inthewild”;alloftheseprevi-ousapproacheshavebeensuccessfullydemonstratedinarestrictedsettingwithafewdistinctiveobjectclasses,buttheirlocalizationresultsturnouttobefarbehindweakly-supervisedresultsonchallengingbenchmarks[12,28,51].

2.相关的工作

在计算机视觉中，无监督的对象的发现一直被尝试。

Sivic等人。

[48]和罗素等人。

[42]适用于统计专题发现模型。

洛杉矶和DAR关系[21]利用部分对应和局部特征聚类。

基姆和济南[28]利用链接分析技术。

该和伊朗[17]提出了聚类组成。

无监督的对象发现，然而，已被证明非常困难的“野生”；所有这些已有的方法已被证明成功的限制和一些独特的对象类的设置，但其定位结果将远弱监督结果上具有挑战性的基准[12，28，51]。

Giventhedifficultyoffullyunsuperviseddiscovery,re-centworkhasmorefocusedonweakly-supervisedap-proachesfromdifferentangles.Cosegmentationistheprob-lemofsegmentingcommonforegroundregionsoutofasetofimages.IthasbeenfirstintroducedbyRotheretal.[38]whofuseMarkovrandomfieldswithcolorhis-togrammatchingtosegmentobjectscommontotwoimages.Sincethen,thisapproachhasbeenimprovedinnu-merousways[4,6,23,55],andextendedtohandlemoregeneralcases[7,25,40,54].Giventhesametypeofin-putascosegmentation,colocalizationseekstolocalizeob-jectswithboundingboxesinsteadofpixel-wisesegmen-tations.Tangetal.[51]usethediscriminativecluster-ingframeworkof[25]tolocalizecommonobjectsinasetofnoisyimages,andJoulinetal.[27]extendittocolo-calizationofvideoframes.Weakly-supervisedlocaliza-tion[9,12,35,36,46,49]sharesthesametypeofoutputascolocalization,butassumesamoresupervisedscenariowithimage-levellabelsthatindicatewhetheratargetob-jectclassappearsintheimageornot.Theselabelsenabletolearnmorediscriminativelocalizationmethods,e.g.,byminingnegativeimages[9].Recentworkondiscriminativepatchdiscovery[15,44,50]learnsmid-levelvisualrepre-sentationsinaweakly-supervisedmode,andusethemforobjectrecognition[15,44]anddiscovery[13,50].

给出了完全无监督发现的困难，重新分工作多集中在弱监督方法从不同的角度。

cosegmentation是分割共同的前景区域的一组图像的问题。

它首先介绍了洛特等人的。

【38】融合马尔可夫随机域颜色直方图匹配的两个图像常见的对象分割。

从那时起，这种方法已经在众多的方法[4，6，23，55]改进，并扩展到处理更一般的情况下，[7，25，40，54]。

相同类型的放式cosegmentation寻求定位，定位对象与包围盒代替像素点”。

唐等。

[51]使用[25]判别聚类框架定位在一组图像常见的物体，和Joulin等人。

[27]扩展到彩色视频帧定位。

弱监督定位方法[9，12，35，36，46，49]股输出共存同一类型，但具有更监督场景图像水平的标签表明确定目标对象类出现在图像或不。

这些标签可以学习更多的歧视定位方法，例如，通过挖掘负的图像[9]。

最近的工作在一个弱监督的模式识别碎片发现学习中级的视觉陈述，并使用它们的对象识别和发现

Regionproposalshavebeenusedinmanyofthemethodsdiscussedsofar,butmostofthem[12,27,28,42,51,54]userelativelyasmallnumberofthebestproposals（typ-ically,lessthan100foreachimage）toformwholeob-jecthypotheses,oftentogetherwithgenericobjectnessmea-sures[1].Incontrast,weusealargenumberofregionpro-posals（typically,between1000and4000）asprimitiveel-ementsformatchingwithoutanyobjectnesspriors.Whilemanyotherapproaches[7,40,41]alsousecorrespondencesbetweenimagepairstodiscoverobjectregions,theydonotuseanefficientpart-basedmatchingapproachsuchasours.Manyofthem[7,21,40]aredrivenbycorrespondencetechniques,e.g.,theSIFTflow[32],basedongenericlocalregions.Inthesensethatsemi-localormid-levelpartsarecrucialforrepresentinggenericobjects[18,30],webelievesegment-levelregionsaremoreadequateforobjectmatch-inganddiscovery.TheworkofRubioetal.[41]introducessuchasegment-levelmatchingtermintheircosegmentationformulation.Unlikeours,however,itrequiresareason-ableinitializationb