Phil Wang
efb94608ea
readme
2023-10-19 09:38:35 -07:00
Phil Wang
1616288e30
add xcit ( #284 )
...
* add xcit
* use Rearrange layers
* give cross correlation transformer a final norm at end
* document
2023-10-13 09:15:13 -07:00
Jason Chou
9e1e824385
Update README.md ( #283 )
...
`patch_size` is size of patches, not number of patches
2023-10-09 11:33:56 -07:00
lucidrains
a36546df23
add simple vit with register tokens example, cite
2023-10-01 08:11:40 -07:00
Phil Wang
6e2393de95
wrap up NaViT
2023-07-25 10:38:55 -07:00
Phil Wang
17675e0de4
add constant token dropout for NaViT
2023-07-24 14:14:36 -07:00
Phil Wang
598cffab53
release NaViT
2023-07-24 13:55:54 -07:00
Phil Wang
23820bc54a
begin work on NaViT ( #273 )
...
finish core idea of NaViT
2023-07-24 13:54:02 -07:00
Phil Wang
c59843d7b8
add a version of simple vit using flash attention
2023-03-18 09:41:39 -07:00
lucidrains
9a8e509b27
separate a simple vit from mp3, so that simple vit can be used after being pretrained
2023-03-07 19:31:10 -08:00
Srikumar Sastry
4218556acd
Add Masked Position Prediction ( #260 )
...
* Create mp3.py
* Implementation: Position Prediction as an Effective Pretraining Strategy
* Added description for Masked Position Prediction
* MP3 image added
2023-03-07 14:28:40 -08:00
Phil Wang
f621c2b041
typo
2023-03-04 20:30:02 -08:00
Phil Wang
bdaf2d1491
adopt dual patchnorm paper for as many vit as applicable, release 1.0.0
2023-02-03 08:11:29 -08:00
Phil Wang
89e1996c8b
add vit with patch dropout, fully embrace structured dropout as multiple papers are now corroborating each other
2022-12-02 11:28:11 -08:00
Phil Wang
cb6d749821
add a 3d version of cct, addressing https://github.com/lucidrains/vit-pytorch/issues/238 0.38.1
2022-10-29 11:35:06 -07:00
Phil Wang
13fabf901e
add vivit
2022-10-24 09:34:04 -07:00
Ryan Russell
c0eb4c0150
Improving Readability ( #220 )
...
Signed-off-by: Ryan Russell <git@ryanrussell.org >
Signed-off-by: Ryan Russell <git@ryanrussell.org >
2022-10-17 10:42:45 -07:00
Phil Wang
b4853d39c2
add the 3d simple vit
2022-10-16 20:45:30 -07:00
Phil Wang
29fbf0aff4
begin extending some of the architectures over to 3d, starting with basic ViT
2022-10-16 15:31:59 -07:00
Phil Wang
4b8f5bc900
add link to Flax translation by @conceptofmind
2022-07-27 08:58:18 -07:00
Phil Wang
4e62e5f05e
make extractor flexible for layers that output multiple tensors, show CrossViT example
2022-06-19 08:11:41 -07:00
Phil Wang
b3e90a2652
add simple vit, from https://arxiv.org/abs/2205.01580
2022-05-03 20:24:14 -07:00
Phil Wang
4ef72fc4dc
add EsViT, by popular request, an alternative to Dino that is compatible with efficient ViTs with accounting for regional self-supervised loss
2022-05-03 10:29:29 -07:00
Zhengzhong Tu
c2aab05ebf
fix bibtex typo ( #212 )
2022-04-06 22:15:05 -07:00
Phil Wang
2d4089c88e
link to maxvit in readme
2022-04-06 16:24:12 -07:00
Phil Wang
c7bb5fc43f
maxvit intent to build ( #211 )
...
complete hybrid mbconv + block / grid efficient self attention MaxViT
2022-04-06 16:12:17 -07:00
Phil Wang
d65a742efe
intent to build ( #210 )
...
complete SepViT, from bytedance AI labs
2022-03-31 14:30:23 -07:00
Phil Wang
df656fe7c7
complete learnable memory ViT, for efficient fine-tuning and potentially plays into continual learning
2022-03-31 09:51:12 -07:00
Phil Wang
6d7298d8ad
link to tensorflow2 translation by @taki0112
2022-03-28 09:05:34 -07:00
Phil Wang
9cd56ff29b
CCT allow for rectangular images
2022-03-26 14:02:49 -07:00
Phil Wang
2aae406ce8
add proposed parallel vit from facebook ai for exploration purposes
2022-03-23 10:42:35 -07:00
Phil Wang
d27721a85a
add scalable vit, from bytedance AI
2022-03-22 17:02:47 -07:00
Phil Wang
6db20debb4
add patch merger
2022-03-01 16:50:17 -08:00
Phil Wang
126d204ff2
fix block repeats in readme example for Nest
2022-01-22 21:32:53 -08:00
Phil Wang
891b92eb74
readme
2021-12-28 16:00:00 -08:00
Phil Wang
70ba532599
add ViT for small datasets https://arxiv.org/abs/2112.13492
2021-12-28 10:58:21 -08:00
Phil Wang
2c368d1d4e
add extractor wrapper
2021-12-21 11:11:39 -08:00
Phil Wang
b983bbee39
release MobileViT, from @murufeng
2021-12-21 10:22:59 -08:00
murufeng
e7075c64aa
Update README.md
2021-12-21 20:44:30 +08:00
Phil Wang
f4b0b14094
add ATS to table of contents
2021-12-03 20:07:18 -08:00
Phil Wang
365b4d931e
add adaptive token sampling paper
2021-12-03 19:52:40 -08:00
Phil Wang
79c864d796
link to community youtuber
2021-11-24 08:13:52 -08:00
Phil Wang
d35345df6a
remove wip
2021-11-22 17:43:04 -08:00
Phil Wang
36e32b70fb
complete and release crossformer
2021-11-22 17:10:53 -08:00
Phil Wang
de0b8ba189
additional diagram
2021-11-22 14:05:39 -08:00
Phil Wang
5b2382f9f0
intent to add
2021-11-22 12:00:03 -08:00
Phil Wang
5ae555750f
add SimMIM
2021-11-21 15:50:19 -08:00
ankandrew
e212918e2d
add Table of Contents
2021-11-17 21:21:19 -03:00
Phil Wang
99c44cf5f6
readme
2021-11-14 11:49:12 -08:00
Phil Wang
5b16e8f809
readme
2021-11-12 20:19:38 -08:00