最近幾篇較好論文實作代碼（附源代碼下載下傳）

關注并星标

從此不迷路

計算機視覺研究院

最近幾篇較好論文實作代碼（附源代碼下載下傳）

公衆号ID｜ComputerVisionGzq

計算機視覺研究院專欄

作者：Edison_G

這個是”計算機視覺研究院“新推出的子產品，後期我們會陸續為大家帶來最新文章及技術的代碼實作分享！

《Towards Layer-wise Image Vectorization》(CVPR 2022)

GitHub: github.com/ma-xu/LIVE

Installation

We suggest users to use the conda for creating new python environment.

Requirement: 5.0<GCC<6.0; nvcc >10.0.

git clone [email protected]:ma-xu/LIVE.gitcd LIVE              conda create -n live python=3.7              conda activate live              conda install -y pytorch torchvision -c pytorch              conda install -y numpy scikit-image              conda install -y -c anaconda cmake              conda install -y -c conda-forge ffmpeg              pip install svgwrite svgpathtools cssutils numba torch-tools scikit-fmm easydict visdom              pip install opencv-python==4.5.4.60 # please install this version to avoid segmentation fault.cd DiffVG              git submodule update --init --recursive              python setup.py installcd ..

Run Experiments

conda activate live              cd LIVE              # Please modify the paramters accordingly.              python main.py --config <config.yaml> --experiment <experiment-setting> --signature <given-folder-name> --target <input-image> --log_dir <log-dir>              # Here is an simple example:              python main.py --config config/base.yaml --experiment experiment_5x1 --signature smile --target figures/smile.png --log_dir log/

《Multimodal Token Fusion for Vision Transformers》(CVPR 2022)

GitHub: github.com/yikaiw/TokenFusion

《PointAugmenting: Cross-Modal Augmentation for 3D Object Detection》(CVPR 2022)

GitHub: github.com/VISION-SJTU/PointAugmenting

《Fantastic questions and where to find them: FairytaleQA -- An authentic dataset for narrative comprehension.》(ACL 2022)

GitHub: github.com/uci-soe/FairytaleQAData

《LUNAR: Unifying Local Outlier Detection Methods via Graph Neural Networks》(AAAI 2022)

GitHub: github.com/agoodge/LUNARFirstly, extract data.zipTo replicate the results on the HRSS dataset with neighbour count k = 100 and "Mixed" negative sampling scheme

Extract saved_models.zip
Run:

python3 main.py --dataset HRSS --samples MIXED --k 100

To train a new model:

python3 main.py --dataset HRSS --samples MIXED --k 100 --train_new_model

《Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music》(ICASSP 2022)

GitHub: github.com/keums/icassp2022-vocal-transcription

《Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice Conversion》(ICASSP 2022)

GitHub: github.com/jlian2/Robust-Voice-Style-TransferDemo：https://jlian2.github.io/Robust-Voice-Style-Transfer/

《HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers》(ICRA 2022)

GitHub: github.com/NVlabs/handover-sim

2022-06-03 16:13:46: Running evaluation for results/2022-02-28_08-57-34_yang-icra2021_s0_test              2022-06-03 16:13:47: Evaluation results:              | success rate | mean accum time (s) | failure (%) |              | (%) | exec | plan | total | hand contact | object drop | timeout |              |:---------------:|:------:|:------:|:-------:|:---------------:|:---------------:|:--------------:|              | 64.58 ( 93/144) | 4.864 | 0.036 | 4.900 | 17.36 ( 25/144) | 11.81 ( 17/144) | 6.25 ( 9/144) |              2022-06-03 16:13:47: Printing scene ids              2022-06-03 16:13:47: Success (93 scenes):              --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- ---              0 1 2 3 4 5 6 7 8 9 10 12 13 15 16 17 18 19 21 22              23 25 26 27 28 30 33 34 35 36 37 38 42 43 46 49 50 53 54 56              59 60 62 63 64 66 68 69 70 71 72 77 81 83 85 87 89 91 92 93              94 95 96 98 103 106 107 108 109 110 111 112 113 114 115 116 117 120 121 123              125 126 127 128 130 131 132 133 137 138 139 141 143              --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- ---              2022-06-03 16:13:47: Failure - hand contact (25 scenes):              --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- ---              11 14 20 29 39 40 41 44 45 47 51 55 57 58 65 67 74 80 82 88              102 105 118 124 136              --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- ---              2022-06-03 16:13:47: Failure - object drop (17 scenes):              --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- ---              24 31 32 52 61 78 79 84 86 97 101 104 119 122 134 140 142              --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- --- ---              2022-06-03 16:13:47: Failure - timeout (9 scenes):              --- --- --- --- --- --- --- --- ---              48 73 75 76 90 99 100 129 135              --- --- --- --- --- --- --- --- ---              2022-06-03 16:13:47: Evaluation complete.

《CDLM: Cross-Document Language Modeling》(EMNLP 2021)

GitHub: github.com/aviclu/CDLM

You can either pretrain by yourself or use the pretrained CDLM model weights and tokenizer files, which are available on HuggingFace.

Then, use：

from transformers import AutoTokenizer, AutoModel              # load model and tokenizer              tokenizer = AutoTokenizer.from_pretrained('biu-nlp/cdlm')              model = AutoModel.from_pretrained('biu-nlp/cdlm')

《Continual Learning for Task-Oriented Dialogue Systems》(EMNLP 2021)

GitHub: github.com/andreamad8/ToDCL

《Torsional Diffusion for Molecular Conformer Generation》(2022)

GitHub: github.com/gcorso/torsional-diffusion

《MMChat: Multi-Modal Chat Dataset on Social Media》(2022)

GitHub: github.com/silverriver/MMChat

《Can CNNs Be More Robust Than Transformers?》(2022)

GitHub: github.com/UCSC-VLAA/RobustCNN

《Revealing Single Frame Bias for Video-and-Language Learning》(2022)

GitHub: github.com/jayleicn/singularity

《Progressive Distillation for Fast Sampling of Diffusion Models》(2022)

GitHub: github.com/Hramchenko/diffusion_distiller

《Neural Basis Models for Interpretability》(2022)

GitHub: github.com/facebookresearch/nbm-spam

《Scalable Interpretability via Polynomials》(2022)

GitHub: github.com/facebookresearch/nbm-spam

《Infinite Recommendation Networks: A Data-Centric Approach》(2022)

GitHub: github.com/noveens/infinite_ae_cf

《The GatedTabTransformer. An enhanced deep learning architecture for tabular modeling》(2022)

GitHub: github.com/radi-cho/GatedTabTransformer

Usage：

import torch              import torch.nn as nn              from gated_tab_transformer import GatedTabTransformer                  model = GatedTabTransformer(              categories = (10, 5, 6, 5, 8), # tuple containing the number of unique values within each category              num_continuous = 10, # number of continuous values              transformer_dim = 32, # dimension, paper set at 32              dim_out = 1, # binary prediction, but could be anything              transformer_depth = 6, # depth, paper recommended 6              transformer_heads = 8, # heads, paper recommends 8              attn_dropout = 0.1, # post-attention dropout              ff_dropout = 0.1, # feed forward dropout              mlp_act = nn.LeakyReLU(0), # activation for final mlp, defaults to relu, but could be anything else (selu, etc.)              mlp_depth=4, # mlp hidden layers depth              mlp_dimension=32, # dimension of mlp layers              gmlp_enabled=True # gmlp or standard mlp              )                  x_categ = torch.randint(0, 5, (1, 5)) # category values, from 0 - max number of categories, in the order as passed into the constructor above              x_cont = torch.randn(1, 10) # assume continuous values are already normalized individually                  pred = model(x_categ, x_cont)              print(pred)

《Distract Your Attention: Multi-head Cross Attention Network for Facial Expression Recognition》(2022)

GitHub: github.com/yaoing/DAN

《Towards Principled Disentanglement for Domain Generalization》(2021)

GitHub: github.com/hlzhang109/DDG

《SoundStream: An End-to-End Neural Audio Codec》(2021)

GitHub: github.com/wesbz/SoundStream

轉載請聯系本公衆号獲得授權

計算機視覺研究院學習群等你加入！

計算機視覺研究院主要涉及深度學習領域，主要緻力于人臉檢測、人臉識别，多目标檢測、目标跟蹤、圖像分割等研究方向。研究院接下來會不斷分享最新的論文算法新架構，我們這次改革不同點就是，我們要着重”研究“。之後我們會針對相應領域分享實踐過程，讓大家真正體會擺脫理論的真實場景，培養愛動手程式設計愛動腦思考的習慣！

計算機視覺研究院

公衆号ID｜ComputerVisionGzq

最近幾篇較好論文實作代碼（附源代碼下載下傳）

Installation

Run Experiments

繼續閱讀

#與ChatGPT的有趣對話#chatGPT的實用技能炸裂了！SQL資料開發群裡小夥伴提問：網址中ie=UTF-8&am

Spring Boot | 事件監聽器異步處理事件，實作代碼解耦

微信登入實作代碼

AI實作代碼轉換，Python轉Java，Java轉Go不再困難？

感覺還是不感覺：輕量級堆疊沙漏網絡（附源代碼下載下傳）

自适應濾波之最小均方（LMS）算法以及matlab實作

C# 桌面截圖工具 demo

NÜWA：多模态預訓練模型，大殺四方！（附源代碼下載下傳）

使用粒子群算法優化供應鍊網絡設計随着企業經營方式的變革，供應鍊管理越來越受到重視。現代供應鍊涉及到從原材料采購到最終産品

不到半小時，chatGPT協助程式設計完成SpringbootJava門店庫存管理核心代碼在不少企業，已陸續開始采用與Cha

強制開機自啟動的安卓app，相關權限和實作代碼

MD5加密,java String 轉變成MD5 String 詳細代碼，工具類Android開發必備

java生成四位随機數，包含數字和字母區分大小寫，特别适合做驗證碼，android開發...

java生成四位随機數，包含數字和字母區分大小寫，特别适合做驗證碼，android開發

第29期最新AI大模型開源項目❶項目名稱：NewHope★開源實作代碼/模型權重★面向程式設計的llama-2-13b微調語

《資料結構與算法分析（C++語言描述）》