detectron2系列：config软件包

2020-09-04 09:48

磐创AI

关注

配置参考

＃－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－

＃ Convention about Training ／ Test specific parameters

＃ Whenever an argument can be either used for training or for testing， the

＃ corresponding name will be post－fixed by a ＿TRAIN for a training parameter，

＃ or ＿TEST for a test－specific parameter．

＃ For example， the number of images during training will be

＃ IMAGES＿PER＿BATCH＿TRAIN， while the number of images for testing will be

＃ IMAGES＿PER＿BATCH＿TEST

＃ Config definition

＿C ＝ CN（）

＃ The version number， to upgrade from old configs to new ones if any

＃ changes happen． It＇s recommended to keep a VERSION in your config file．

＿C．VERSION ＝ 2

＿C．MODEL ＝ CN（）

＿C．MODEL．LOAD＿PROPOSALS ＝ False

＿C．MODEL．MASK＿ON ＝ False

＿C．MODEL．KEYPOINT＿ON ＝ False

＿C．MODEL．DEVICE ＝＂cuda＂

＿C．MODEL．META＿ARCHITECTURE ＝＂GeneralizedRCNN＂

＃ Path （possibly with schema like catalog：／／ or detectron2：／／） to a checkpoint file

＃ to be loaded to the model． You can find available models in the model zoo．

＿C．MODEL．WEIGHTS ＝＂＂

＃ Values to be used for image normalization （BGR order， since INPUT．FORMAT defaults to BGR）．

＃ To train on images of different number of channels， just set different mean ＆ std．

＃ Default values are the mean pixel value from ImageNet：［103．53， 116．28， 123．675］

＿C．MODEL．PIXEL＿MEAN ＝［103．530， 116．280， 123．675］

＃ When using pre－trained models in Detectron1 or any MSRA models，

＃ std has been absorbed into its conv1 weights， so the std needs to be set 1．

＃ Otherwise， you can use ［57．375， 57．120， 58．395］（ImageNet std）

＿C．MODEL．PIXEL＿STD ＝［1．0， 1．0， 1．0］

＃ INPUT

＿C．INPUT ＝ CN（）

＃ Size of the smallest side of the image during training

＿C．INPUT．MIN＿SIZE＿TRAIN ＝（800，）

＃ Sample size of smallest side by choice or random selection from range give by

＃ INPUT．MIN＿SIZE＿TRAIN

＿C．INPUT．MIN＿SIZE＿TRAIN＿SAMPLING ＝＂choice＂

＃ Maximum size of the side of the image during training

＿C．INPUT．MAX＿SIZE＿TRAIN ＝ 1333

＃ Size of the smallest side of the image during testing． Set to zero to disable resize in testing．

＿C．INPUT．MIN＿SIZE＿TEST ＝ 800

＃ Maximum size of the side of the image during testing

＿C．INPUT．MAX＿SIZE＿TEST ＝ 1333

＃｀True｀ if cropping is used for data augmentation during training

＿C．INPUT．CROP ＝ CN（｛＂ENABLED＂： False｝）

＃ Cropping type：

＃－＂relative＂ crop （H ＊ CROP．SIZE［0］， W ＊ CROP．SIZE［1］） part of an input of size （H， W）

＃－＂relative＿range＂ uniformly sample relative crop size from between ［CROP．SIZE［0］，［CROP．SIZE［1］］．

＃ and ［1， 1］ and use it as in ＂relative＂ scenario．

＃－＂absolute＂ crop part of an input with absolute size：（CROP．SIZE［0］， CROP．SIZE［1］）．

＿C．INPUT．CROP．TYPE ＝＂relative＿range＂

＃ Size of crop in range （0， 1］ if CROP．TYPE is ＂relative＂ or ＂relative＿range＂ and in number of

＃ pixels if CROP．TYPE is ＂absolute＂

＿C．INPUT．CROP．SIZE ＝［0．9， 0．9］

＃ Whether the model needs RGB， YUV， HSV etc．

＃ Should be one of the modes defined here， as we use PIL to read the image：

＃ https：／／pillow．readthedocs．io／en／stable／handbook／concepts．html＃concept－modes

＃ with BGR being the one exception． One can set image format to BGR， we will

＃ internally use RGB for conversion and flip the channels over

＿C．INPUT．FORMAT ＝＂BGR＂

＃ The ground truth mask format that the model will use．

＃ Mask R－CNN supports either ＂polygon＂ or ＂bitmask＂ as ground truth．

＿C．INPUT．MASK＿FORMAT ＝＂polygon＂＃ alternative：＂bitmask＂

＃ Dataset

＿C．DATASETS ＝ CN（）

＃ List of the dataset names for training． Must be registered in DatasetCatalog

＿C．DATASETS．TRAIN ＝（）

＃ List of the pre－computed proposal files for training， which must be consistent

＃ with datasets listed in DATASETS．TRAIN．

＿C．DATASETS．PROPOSAL＿FILES＿TRAIN ＝（）

＃ Number of top scoring precomputed proposals to keep for training

＿C．DATASETS．PRECOMPUTED＿PROPOSAL＿TOPK＿TRAIN ＝ 2000

＃ List of the dataset names for testing． Must be registered in DatasetCatalog

＿C．DATASETS．TEST ＝（）

＃ List of the pre－computed proposal files for test， which must be consistent

＃ with datasets listed in DATASETS．TEST．

＿C．DATASETS．PROPOSAL＿FILES＿TEST ＝（）

＃ Number of top scoring precomputed proposals to keep for test

＿C．DATASETS．PRECOMPUTED＿PROPOSAL＿TOPK＿TEST ＝ 1000

＃ DataLoader

＿C．DATALOADER ＝ CN（）

＃ Number of data loading threads

＿C．DATALOADER．NUM＿WORKERS ＝ 4

＃ If True， each batch should contain only images for which the aspect ratio

＃ is compatible． This groups portrait images together， and landscape images

＃ are not batched with portrait images．

＿C．DATALOADER．ASPECT＿RATIO＿GROUPING ＝ True

＃ Options： TrainingSampler， RepeatFactorTrainingSampler

＿C．DATALOADER．SAMPLER＿TRAIN ＝＂TrainingSampler＂

＃ Repeat threshold for RepeatFactorTrainingSampler

＿C．DATALOADER．REPEAT＿THRESHOLD ＝ 0．0

＃ if True， the dataloader will filter out images that have no associated

＃ annotations at train time．

＿C．DATALOADER．FILTER＿EMPTY＿ANNOTATIONS ＝ True

＃－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－－＃

＃ Backbone options

＿C．MODEL．BACKBONE ＝ CN（）

＿C．MODEL．BACKBONE．NAME ＝＂build＿resnet＿backbone＂

＃ Freeze the first several stages so they are not trained．

＃ There are 5 stages in ResNet． The first is a convolution， and the following

＃ stages are each group of residual blocks．

＿C．MODEL．BACKBONE．FREEZE＿AT ＝ 2

＃ FPN options

＿C．MODEL．FPN ＝ CN（）

＃ Names of the input feature maps to be used by FPN

＃ They must have contiguous power of 2 strides

＃ e．g．，［＂res2＂，＂res3＂，＂res4＂，＂res5＂］

＿C．MODEL．FPN．IN＿FEATURES ＝［］

＿C．MODEL．FPN．OUT＿CHANNELS ＝ 256

＃ Options：＂＂（no norm），＂GN＂

＿C．MODEL．FPN．NORM ＝＂＂

＃ Types for fusing the FPN top－down and lateral features． Can be either ＂sum＂ or ＂avg＂

＿C．MODEL．FPN．FUSE＿TYPE ＝＂sum＂

＃ Proposal generator options

＿C．MODEL．PROPOSAL＿GENERATOR ＝ CN（）

＃ Current proposal generators include ＂RPN＂，＂RRPN＂ and ＂PrecomputedProposals＂

＿C．MODEL．PROPOSAL＿GENERATOR．NAME ＝＂RPN＂

＃ Proposal height and width both need to be greater than MIN＿SIZE

＃（a the scale used during training or inference）

＿C．MODEL．PROPOSAL＿GENERATOR．MIN＿SIZE ＝ 0

＃ Anchor generator options

＿C．MODEL．ANCHOR＿GENERATOR ＝ CN（）

＃ The generator can be any name in the ANCHOR＿GENERATOR registry

＿C．MODEL．ANCHOR＿GENERATOR．NAME ＝＂DefaultAnchorGenerator＂

＃ Anchor sizes （i．e． sqrt of area） in absolute pixels w．r．t． the network input．

＃ Format： list［list［int］］． SIZES［i］ specifies the list of sizes

＃ to use for IN＿FEATURES［i］； len（SIZES）＝＝ len（IN＿FEATURES） must be true，

＃ or len（SIZES）＝＝ 1 is true and size list SIZES［0］ is used for all

＃ IN＿FEATURES．

＿C．MODEL．ANCHOR＿GENERATOR．SIZES ＝［［32， 64， 128， 256， 512］］

＃ Anchor aspect ratios． For each area given in ｀SIZES｀， anchors with different aspect

＃ ratios are generated by an anchor generator．

＃ Format： list［list［int］］． ASPECT＿RATIOS［i］ specifies the list of aspect ratios

＃ to use for IN＿FEATURES［i］； len（ASPECT＿RATIOS）＝＝ len（IN＿FEATURES） must be true，

＃ or len（ASPECT＿RATIOS）＝＝ 1 is true and aspect ratio list ASPECT＿RATIOS［0］ is used

＃ for all IN＿FEATURES．

＿C．MODEL．ANCHOR＿GENERATOR．ASPECT＿RATIOS ＝［［0．5， 1．0， 2．0］］

＃ Anchor angles．

＃ list［float］， the angle in degrees， for each input feature map．

＃ ANGLES［i］ specifies the list of angles for IN＿FEATURES［i］．

＿C．MODEL．ANCHOR＿GENERATOR．ANGLES ＝［［－90， 0， 90］］

＃ Relative offset between the center of the first anchor and the top－left corner of the image

＃ Units： fraction of feature map stride （e．g．， 0．5 means half stride）

＃ Allowed values are floats in ［0， 1） range inclusive．

＃ Recommended value is 0．5， although it is not expected to affect model accuracy．

＿C．MODEL．ANCHOR＿GENERATOR．OFFSET ＝ 0．0

＃ RPN options

＿C．MODEL．RPN ＝ CN（）

＿C．MODEL．RPN．HEAD＿NAME ＝＂StandardRPNHead＂＃ used by RPN＿HEAD＿REGISTRY

＃ Names of the input feature maps to be used by RPN

＃ e．g．，［＂p2＂，＂p3＂，＂p4＂，＂p5＂，＂p6＂］ for FPN

＿C．MODEL．RPN．IN＿FEATURES ＝［＂res4＂］

＃ Remove RPN anchors that go outside the image by BOUNDARY＿THRESH pixels

＃ Set to －1 or a large value， e．g． 100000， to disable pruning anchors

＿C．MODEL．RPN．BOUNDARY＿THRESH ＝－1

＃ IOU overlap ratios ［BG＿IOU＿THRESHOLD， FG＿IOU＿THRESHOLD］

＃ Minimum overlap required between an anchor and ground－truth box for the

＃（anchor， gt box） pair to be a positive example （IoU ＞＝ FG＿IOU＿THRESHOLD

＃＝＝＞ positive RPN example： 1）

＃ Maximum overlap allowed between an anchor and ground－truth box for the

＃（anchor， gt box） pair to be a negative examples （IoU ＜ BG＿IOU＿THRESHOLD

＃＝＝＞ negative RPN example： 0）

＃ Anchors with overlap in between （BG＿IOU＿THRESHOLD ＜＝ IoU ＜ FG＿IOU＿THRESHOLD）

＃ are ignored （－1）

＿C．MODEL．RPN．IOU＿THRESHOLDS ＝［0．3， 0．7］

＿C．MODEL．RPN．IOU＿LABELS ＝［0，－1， 1］

＃ Total number of RPN examples per image

＿C．MODEL．RPN．BATCH＿SIZE＿PER＿IMAGE ＝ 256

＃ Target fraction of foreground （positive） examples per RPN minibatch

＿C．MODEL．RPN．POSITIVE＿FRACTION ＝ 0．5

＃ Weights on （dx， dy， dw， dh） for normalizing RPN anchor regression targets

＿C．MODEL．RPN．BBOX＿REG＿WEIGHTS ＝（1．0， 1．0， 1．0， 1．0）

＃ The transition point from L1 to L2 loss． Set to 0．0 to make the loss simply L1．

＿C．MODEL．RPN．SMOOTH＿L1＿BETA ＝ 0．0

＿C．MODEL．RPN．LOSS＿WEIGHT ＝ 1．0

＃ Number of top scoring RPN proposals to keep before applying NMS

＃ When FPN is used， this is ＊per FPN level＊（not total）

＿C．MODEL．RPN．PRE＿NMS＿TOPK＿TRAIN ＝ 12000

＿C．MODEL．RPN．PRE＿NMS＿TOPK＿TEST ＝ 6000

＃ Number of top scoring RPN proposals to keep after applying NMS

＃ When FPN is used， this limit is applied per level and then again to the union

＃ of proposals from all levels

＃ NOTE： When FPN is used， the meaning of this config is different from Detectron1．

＃ It means per－batch topk in Detectron1， but per－image topk here．

＃ See ＂modeling／rpn／rpn＿outputs．py＂ for details．

＿C．MODEL．RPN．POST＿NMS＿TOPK＿TRAIN ＝ 2000

＿C．MODEL．RPN．POST＿NMS＿TOPK＿TEST ＝ 1000

＃ NMS threshold used on RPN proposals

＿C．MODEL．RPN．NMS＿THRESH ＝ 0．7

本地收藏打印推荐给朋友

声明： 本文由入驻维科号的作者撰写，观点仅代表作者本人，不代表OFweek立场。如有侵权或其他问题，请联系举报。

发表评论

共0条评论，0人参与

登录登录即可访问所有OFweek服务

用户名/邮箱/手机：
密码：
忘记密码？
用其他账号登录： QQ | 微信 | 新浪微博

请输入评论内容...

请输入评论/评论长度6~500个字

暂无评论

图片新闻