--> AIÖÇÄÜ¿ÙͼµÄÖ÷Á÷Ëã·¨ - º¼ÖÝAG¿­·¢K8¹ú¼ÊÊÓ¾õ¿Æ¼¼ÓÐÏÞ¹«Ë¾

AG¿­·¢K8¹ú¼Ê


AIÖÇÄÜ¿ÙͼµÄÖ÷Á÷Ëã·¨

392
·¢±íʱ¼ä£º2022-04-13 14:43

AIÖÇÄÜ¿Ùͼ

¶ÔÓÚÒ»ÕÅͼI£¬ AG¿­·¢K8¹ú¼Ê¸ÐÐËȤµÄÈËÏñ²¿·Ö³ÆÎªÇ°¾°F£¬ÆäÓಿ·ÖΪ±³¾°B£¬ÔòͼÏñI¿ÉÒÔÊÓΪFÓëBµÄ¼ÓȨÈÚºÏI = alpha * F + (1 - alpha) * BI=alpha?F+(1?alpha)?B£¬¶ø¿ÙͼÈÎÎñ¾ÍÊÇÕÒµ½ºÏÊʵÄÈ¨ÖØalpha¡£ÖµµÃÒ»ÌáµÄÊÇ£¬Èçͼ£¬²é¿´¿Ùͼground truth¿ÉÒÔ¿´µ½£¬alphaÊÇ[0, 1]Ö®¼äµÄÁ¬ÐøÖµ£¬¿ÉÒÔÀí½âΪÏñËØÊôÓÚǰ¾°µÄ¸ÅÂÊ£¬ÕâÓëÈËÏñ·Ö¸îÊDz»Í¬µÄ¡£Èçͼ£¬ÔÚÈËÏñ·Ö¸îÈÎÎñÖУ¬alphaÖ»ÄÜÈ¡0»ò1£¬±¾ÖÊÉÏÊÇ·ÖÀàÈÎÎñ£¬¶ø¿ÙͼÊǻعéÈÎÎñ¡£

¿Ùͼground truth£º

·Ö¸îground truth£º

Ïà¹Ø¹¤×÷

AG¿­·¢K8¹ú¼ÊÖ÷Òª¹Ø×¢±È½ÏÓдú±íÐԵĻùÓÚÉî¶ÈѧϰµÄ¿ÙͼËã·¨¡£Ä¿Ç°Á÷ÐеĿÙͼËã·¨´óÖ¿ÉÒÔ·ÖΪÁ½À࣬һÖÖÊÇÐèÒªÏÈÑéÐÅÏ¢µÄTrimap-basedµÄ·½·¨£¬¿í·ºµÄÏÈÑéÐÅÏ¢°üÀ¨Trimap¡¢´Ö²Úmask¡¢ÎÞÈ˵ı³¾°Í¼Ïñ¡¢PoseÐÅÏ¢µÈ£¬ÍøÂçʹÓÃÏÈÑéÐÅÏ¢ÓëͼƬÐÅÏ¢¹²Í¬Ô¤²âalpha£»ÁíÒ»ÖÖÔòÊÇTrimap-freeµÄ·½·¨£¬½ö¸ù¾ÝͼƬÐÅÏ¢Ô¤²âalpha£¬¶Ôʵ¼ÊÓ¦ÓøüÓѺ㬵«Ð§¹ûÆÕ±é²»ÈçTrimap-basedµÄ·½·¨¡£

Trimap-based

TrimapÊÇ×î³£ÓõÄÏÈÑé֪ʶ£¬¹ËÃû˼ÒåTrimapÊÇÒ»¸öÈýԪͼ£¬Ã¿¸öÏñËØÈ¡ÖµÎª{0£¬128£¬255}ÆäÖÐÖ®Ò»£¬·Ö±ð´ú±íǰ¾°¡¢Î´ÖªÓë±³¾°£¬Èçͼ¡£

Deep Image Matting

¶àÊý¿ÙͼËã·¨²ÉÓÃÁËTrimap×÷ΪÏÈÑé֪ʶ¡£AdobeÔÚ17ÄêÌá³öÁËDeep Image Matting[^1]£¬ÕâÊÇÊ׸ö¶Ëµ½¶ËÔ¤²âalphaµÄËã·¨£¬Õû¸öÄ£ÐÍ·ÖMatting encoder-decoder stageÓëMatting refinement stageÁ½¸ö²¿·Ö£¬Matting encoder-decoder stageÊǵÚÒ»²¿·Ö£¬¸ù¾ÝÊäÈëͼÏñÓë¶ÔÓ¦µÄTrimap£¬µÃµ½½ÏΪ´ÖÂÔµÄalpha matte¡£Matting refinement stageÊÇÒ»¸öСµÄ¾í»ýÍøÂ磬ÓÃÀ´ÌáÉýalpha matteµÄ¾«¶ÈÓë±ßÔµ±íÏÖ¡£

±¾ÎÄÔÚµ±Ê±´ïµ½ÁËstate-of-the-art£¬ºóÐøºÜ¶àÎÄÕ¶¼ÑØÓÃÁËÕâÖÖ¡°´ÖÂÔ-¾«Ï¸¡±µÄ¿Ùͼ˼·£¬´ËÍ⣬ÓÉÓÚ±ê×¢³É±¾¸ß£¬¹ýÈ¥¿ÙͼÈÎÎñµÄÊý¾ÝÊǷdz£ÓÐÏ޵ġ£±¾ÎÄ»¹Í¨¹ýºÏ³ÉÌá³öÁËÒ»¸ö´óÊý¾Ý¼¯Composition-1K£¬½«¾«Ï¸±ê×¢µÄǰ¾°Ó벻ͬ±³¾°Èںϣ¬µÃµ½ÁË45500ѵÁ·Í¼ÏñºÍ1000²âÊÔͼÏñ£¬´ó´ó·á¸»ÁË¿ÙͼÈÎÎñµÄÊý¾Ý¡£

Background Matting

Background Matting[^2]ÊÇ»ªÊ¢¶Ù´óѧÌá³öµÄ¿ÙͼËã·¨£¬ºóÐø·¢²¼ÁËBackgroun MattingV2£¬·½·¨±È½ÏÓд´Ðµã£¬²¢ÇÒÔÚʵ¼Ê¹¤³ÌÓ¦ÓÃÖÐÈ¡µÃÁ˲»´íµÄЧ¹û¡£

ͬʱ£¬ÓÉÓÚAdobeµÄÊý¾Ý¶¼ÊÇ»ùÓںϳɵÄ£¬ÎªÁ˸üºÃµÄÊÊÓ¦ÕæÊµÊäÈ룬ÎÄÖÐÌá³öÒ»¸ö×Ô¼à¶½ÍøÂçѵÁ·G_{Real}GRealÀ´¶Ôδ±ê×¢µÄÕæÊµÊäÈë½øÐÐѧϰ¡£G_{Real}GRealÊäÈëÓëG_{Adobe}GAdobeÏàͬ£¬ÓÃG_{Adobe}GAdobeÊä³öµÄalpha matteÓëFÀ´¼à¶½G_{Real}GRealµÄÊä³öµÃµ½loss£¬´ËÍ⣬G_{Real}GRealµÄÊä³öºÏ³ÉµÃµ½µÄRGB»¹½«Í¨¹ýÒ»¸ö¼ø±ðÆ÷À´ÅжÏÕæÎ±µÃµ½µÚ¶þ¸öloss£¬¹²Í¬ÑµÁ·G_{Real}GReal¡£

ÎÄÖÐÁоÙÁËһЩʹÓÃÊÖ»úÅÄÉãµÃµ½µÄ²âÊÔ½á¹û£¬¿ÉÒÔ¿´µ½´ó²¿·ÖÇé¿ö½á¹û»¹ÊǺܲ»´íµÄ¡£

Background Matting V2

Background MattingµÃµ½Á˲»´íµÄЧ¹û£¬µ«¸ÃÏîÄ¿ÎÞ·¨ÊµÊ±ÔËÐУ¬Ò²ÎÞ·¨ºÜºÃµÄ´¦Àí¸ß·Ö±æÂÊÊäÈë¡£ËùÒÔÏîÄ¿ÍŶÓÓÖÍÆ³öÁËBackground Matting V2[^3]£¬¸ÃÏîÄ¿¿ÉÒÔÒÔ30fpsµÄËÙ¶ÈÔÚ4kÊäÈëÉϵõ½²»´íµÄ½á¹û¡£

ÎÄÕÂʵÏÖ¸ßЧ¸ß·Ö±æÂÊ¿ÙͼµÄÒ»¸öÖØÒªÏë·¨ÊÇ£¬alpha matteÖд󲿷ÖÏñËØÊÇ0»ò1£¬Ö»ÓÐÉÙÁ¿µÄÇøÓò°üº¬¹ý¶ÉÏñËØ¡£Òò´ËÎÄÕ½«ÍøÂç·ÖΪbaseÍøÂçºÍrefineÍøÂ磬baseÍøÂç¶ÔµÍ·Ö±æÂÊͼÏñ½øÐд¦Àí£¬refineÍøÂç¸ù¾ÝbaseÍøÂçµÄ´¦Àí½á¹ûÑ¡Ôñԭʼ¸ß·Ö±æÂÊͼÏñÉÏÌØ¶¨Í¼Ïñ¿é½øÐд¦Àí¡£

baseÍøÂçÊäÈëΪc±¶Ï²ÉÑùµÄͼÏñÓë±³¾°£¬Í¨¹ýencoder-decoderÊä³ö´ÖÂÔµÄalpha matte¡¢F¡¢error mapÓëhidden features¡£½«²ÉÑùc±¶µÃµ½µÄerror map E_cEcÉϲÉÑùµ½Ô­Ê¼·Ö±æÂʵÄ\frac{1}{4}41ΪE_4E4£¬ÔòE_4E4ÿ¸öÏñËØ¶ÔӦԭͼ4x4ͼÏñ¿é£¬´ÓE_4E4Ñ¡Ôñtopk errorÏñËØ£¬¼´ÎªÔ­Ê¼topk error 4x4ͼÏñ¿é¡£ÔÚÑ¡Ôñ³öµÄÏñËØÖÜΧ²Ã¼ô³ö¶à¸ö8x8ͼÏñ¿éËÍÈërefineÍøÂç¡£refineÍøÂçÊÇÒ»¸ötwo-stageÍøÂ磬Ê×ÏȽ«ÊäÈëͨ¹ý²¿·ÖCBR²Ù×÷µÃµ½µÚÒ»½×¶ÎÊä³ö£¬ÓëԭʼÊäÈëÖÐÌáÈ¡µÄ8x8ͼÏñ¿écatºóÊäÈëµÚ¶þ½×¶Î£¬×îºó½«refineºóµÄͼÏñ¿éÓëbaseµÃµ½µÄ½á¹û½»»»µÃµ½×îÖÕµÄalpha matteºÍF¡£

´ËÍâÎÄÕ»¹·¢²¼ÁËÁ½¸öÊý¾Ý¼¯£ºÊÓÆµ¿ÙͼÊý¾Ý¼¯VideoMatte240KÓëͼÏñ¿ÙͼÊý¾Ý¼¯PhotoMatte13K/85¡£VideoMatte240KÊÕ¼¯ÁË484¸ö¸ß·Ö±æÂÊÊÓÆµ£¬Ê¹ÓÃChroma-keyÈí¼þÉú³ÉÁË240000+ǰ¾°ºÍalpha matte¶Ô¡£PhotoMatte13K/85ÔòÊÇÔÚÁ¼ºÃ¹âÕÕÏÂÅÄÉãÕÕÆ¬Ê¹ÓÃÈí¼þºÍÊÖ¹¤µ÷ÕûµÄ·½·¨µÃµ½13000+ǰ¾°Óëalpha matteÊý¾Ý¶Ô¡£´óÐÍÊý¾Ý¼¯Í¬ÑùÊDZ¾ÎĵÄÖØÒª¹±Ï×Ö®Ò»¡£

´ËÍ⻹ÓÐһЩÎÄÕÂÈçInductive Guided Filter[^4]¡¢MGMatting[^5]µÈ£¬Ê¹ÓôÖÂÔµÄmask×÷ΪÏÈÑéÐÅÏ¢Ô¤²âalpha matte£¬ÔÚÓ¦ÓÃʱҲ±ÈtrimapÓѺúܶà¡£MGMattingͬʱҲÌá³öÁËÒ»¸öÓÐ636Õž«È·±ê×¢ÈËÏñµÄ¿ÙͼÊý¾Ý¼¯RealWorldPortrait-636£¬¿ÉÒÔͨ¹ýºÏ³ÉµÈÊý¾ÝÔö¹ã·½·¨À©Õ¹Ê¹Óá£

Trimap-free

ʵ¼ÊÓ¦ÓÃÖÐÏÈÑéÐÅÏ¢»ñÈ¡ÆðÀ´ÊǺܲ»·½±ãµÄ£¬Ò»Ð©ÎÄÕ½«ÏÈÑéÐÅÏ¢»ñÈ¡µÄ²¿·ÖÒ²·ÅÔÚÍøÂçÖнøÐС£

Semantic Human Matting

°¢Àï°Í°ÍÌá³öµÄSemantic Human Matting[^6]ͬÑù·Ö½âÁË¿ÙͼÈÎÎñ£¬ÍøÂç·ÖΪÈý¸ö²¿·Ö£¬T-Net¶ÔÏñËØÈý·ÖÀàµÃµ½Trimap£¬ÓëͼÏñconcatµÃµ½ÁùͨµÀÊäÈëËÍÈëM-Net£¬M-Netͨ¹ýencoder-decoderµÃµ½½ÏΪ´Ö²ÚµÄalpha matte£¬×îºó½«T-NetÓëM-NetµÄÊä³öËÍÈëÈÚºÏÄ£¿éFusion Module£¬×îÖյõ½¸ü¾«È·µÄalpha matte¡£

ÍøÂçѵÁ·Ê±µÄalpha loss·ÖΪalpha lossÓëcompositional loss£¬ÓëDIMÀàËÆ£¬´ËÍ⻹¼ÓÈëÁËÏñËØ·ÖÀàlossL_tLt£¬×îÖÕlossΪ£ºL = L_p + L_t=L_\alpha + L_c + L_tL=Lp+Lt=L¦Á+Lc+Lt¡£ÎÄÕÂʵÏÖÁ˶˵½¶ËTrimap-freeµÄ¿ÙͼËã·¨£¬µ«½ÏΪӷÖס£´ËÍâÎÄÕÂÌá³öFashion ModelÊý¾Ý¼¯£¬´ÓµçÉÌÍøÕ¾ÊÕ¼¯ÕûÀíÁË35000+±ê×¢µÄͼƬ£¬µ«²¢Ã»Óпª·Å¡£

Modnet

modnet[^7]ÈÏΪÉñ¾­ÍøÂç¸üÉó¤Ñ§Ï°µ¥Ò»ÈÎÎñ£¬ËùÒÔ½«¿ÙͼÈÎÎñ·ÖΪÈý¸ö×ÓÈÎÎñ£¬·Ö±ð½øÐÐÏÔʽ¼à¶½ÑµÁ·ºÍͬ²½ÓÅ»¯£¬×îÖÕ¿ÉÒÔÒÔ63fpsÔÚ512x512ÊäÈëÏ´ﵽsoft½á¹û£¬Òò´ËÔÚºóÐøµÄ¹¤³ÌʵÏÖÖÐÎÒҲѡÔñÁËmodnet×÷ΪBaseline¡£

ÍøÂçµÄÈý¸ö×ÓÈÎÎñ·Ö±ðÊÇSemantic Estimation¡¢Detail PredictionºÍSemantic-Detail Fusion£¬Semantic Estimation²¿·ÖÓÉbackboneÓëdecoder×é³É£¬Êä³öÏà¶ÔÓÚÊäÈëϲÉÑù16±¶µÄsemantics£¬ÓÃÀ´ÌṩÓïÒåÐÅÏ¢£¬´ËÈÎÎñµÄground truthÊDZê×¢µÄalpha¾­¹ýϲÉÑùÓë¸ß˹Â˲¨µÃµ½µÄ¡£ Detail PredictionÈÎÎñÊäÈëÓÐÈý¸ö£ºÔ­Ê¼Í¼Ïñ¡¢semantic·ÖÖ§µÄÖмäÌØÕ÷ÒÔ¼°S·ÖÖ§µÄÊä³öS_pSp£¬D·Ö֧ͬÑùÊÇencoder-decoder½á¹¹£¬ÖµµÃÁôÒâµÄ¸Ã·ÖÖ§µÄloss£¬ÓÉÓÚD·ÖÖ§Ö»¹Ø×¢Ï¸½ÚÌØÕ÷£¬ËùÒÔͨ¹ýground truth alphaÉú³Étrimap£¬Ö»ÔÚtrimapµÄunknownÇøÓò¼ÆËãd_pdpÓë\alpha_g¦ÁgµÄL_1L1Ëðʧ¡£F·ÖÖ§¶ÔÓïÒåÐÅÏ¢Óëϸ½ÚÔ¤²â½øÐÐÈںϣ¬µÃµ½×îÖÕµÄalpha matteÓëground truth¼ÆËãL_1L1Ëðʧ£¬ÍøÂçѵÁ·µÄ×ÜËðʧΪ£ºL=\lambda_sL_s + \lambda_dL_d+\lambda_{\alpha}L_{\alpha}L=¦ËsLs+¦ËdLd+¦Ë¦ÁL¦Á¡£

×îºó£¬ÎÄÕ»¹Ìá³öÁËÒ»ÖÖʹÊÓÆµ½á¹ûÔÚʱ¼äÉϸüƽ»¬µÄºó´¦Àí·½Ê½OFD£¬ÔÚǰºóÁ½Ö¡½ÏΪÏàËÆ¶øÖмäÖ¡ÓëǰºóÁ½Ö¡¾àÀë½Ï´óʱ£¬Ê¹ÓÃǰºóÖ¡µÄƽ¾ùֵƽ»¬ÖмäÖ¡£¬µ«¸Ã·½·¨»áµ¼ÖÂʵ¼Ê½á¹û±ÈÊäÈëÑÓ³ÙÒ»Ö¡¡£

´ËÍ⣬U^2U2-Net¡¢SIMµÈÍøÂç¿ÉÒÔ¶ÔͼÏñ½øÐÐÏÔÖøÐÔ¿Ùͼ¡£


ÓÑÇéÁ´½Ó£ºERPϵͳ£üÇý¶¯IC£üÕ¿½­ÅàÑµÍø£ü24λADC |
΢ÐŹ«Öںţº
AG¿­·¢K8¹ú¼ÊÎĵµ£º
µØÖ·£ºÕã½­Ê¡º¼ÖÝÊÐÓàº¼Çø²Öǰ½ÖµÀÁú԰·88ºÅ3ºÅÂ¥A1318ÊÒ(´´öÎʱ´ú¹ã³¡)
sitemap¡¢ÍøÕ¾µØÍ¼