Phil Wang
|
719048d1bd
|
some better defaults for scalable vit
0.28.1
|
2022-03-22 17:19:58 -07:00 |
|
Phil Wang
|
d27721a85a
|
add scalable vit, from bytedance AI
0.28.0
|
2022-03-22 17:02:47 -07:00 |
|
Phil Wang
|
cb22cbbd19
|
update to einops 0.4, which is torchscript jit friendly
0.27.1
|
2022-03-22 13:58:00 -07:00 |
|
Phil Wang
|
6db20debb4
|
add patch merger
0.27.0
|
2022-03-01 16:50:17 -08:00 |
|
Phil Wang
|
1bae5d3cc5
|
allow for rectangular images for efficient adapter
0.26.7
|
2022-01-31 08:55:31 -08:00 |
|
Phil Wang
|
25b384297d
|
return None from extractor if no attention layers
0.26.6
|
2022-01-28 17:49:58 -08:00 |
|
Phil Wang
|
64a07f50e6
|
epsilon should be inside square root
0.26.5
|
2022-01-24 17:24:41 -08:00 |
|
Phil Wang
|
126d204ff2
|
fix block repeats in readme example for Nest
|
2022-01-22 21:32:53 -08:00 |
|
Phil Wang
|
c1528acd46
|
fix feature maps in Nest, thanks to @MarkYangjiayi
0.26.4
|
2022-01-22 13:17:30 -08:00 |
|
Phil Wang
|
1cc0f182a6
|
decoder positional embedding needs to be reapplied https://twitter.com/giffmana/status/1479195631587631104
|
2022-01-06 13:14:41 -08:00 |
|
Phil Wang
|
28eaba6115
|
0.26.2
0.26.2
|
2022-01-03 12:56:34 -08:00 |
|
Phil Wang
|
0082301f9e
|
build @jrounds suggestion
|
2022-01-03 12:56:25 -08:00 |
|
Phil Wang
|
91ed738731
|
0.26.1
0.26.1
|
2021-12-30 19:31:26 -08:00 |
|
Phil Wang
|
1b58daa20a
|
Merge pull request #186 from chinhsuanwu/mobilevit
Update MobileViT
|
2021-12-30 19:31:01 -08:00 |
|
chinhsuanwu
|
f2414b2c1b
|
Update MobileViT
|
2021-12-30 05:52:23 +08:00 |
|
Phil Wang
|
891b92eb74
|
readme
|
2021-12-28 16:00:00 -08:00 |
|
Phil Wang
|
70ba532599
|
add ViT for small datasets https://arxiv.org/abs/2112.13492
|
2021-12-28 10:58:21 -08:00 |
|
Phil Wang
|
e52ac41955
|
allow extractor to only return embeddings, to ready for vision transformers to be used in x-clip
0.25.6
|
2021-12-25 12:31:21 -08:00 |
|
Phil Wang
|
0891885485
|
include tests in package for conda
0.25.5
|
2021-12-22 12:44:29 -08:00 |
|
Phil Wang
|
976f489230
|
add some tests
0.25.3
|
2021-12-22 09:13:31 -08:00 |
|
Phil Wang
|
2c368d1d4e
|
add extractor wrapper
0.25.1
|
2021-12-21 11:11:39 -08:00 |
|
Phil Wang
|
b983bbee39
|
release MobileViT, from @murufeng
|
2021-12-21 10:22:59 -08:00 |
|
Phil Wang
|
86a7302ba6
|
Merge pull request #181 from murufeng/main
Add MobileViT
|
2021-12-21 09:51:56 -08:00 |
|
murufeng
|
89d3a04b3f
|
Add files via upload
|
2021-12-21 20:48:34 +08:00 |
|
murufeng
|
e7075c64aa
|
Update README.md
|
2021-12-21 20:44:30 +08:00 |
|
murufeng
|
5ea1559e4c
|
Add files via upload
|
2021-12-21 20:41:01 +08:00 |
|
Phil Wang
|
f4b0b14094
|
add ATS to table of contents
|
2021-12-03 20:07:18 -08:00 |
|
Phil Wang
|
365b4d931e
|
add adaptive token sampling paper
0.24.3
|
2021-12-03 19:52:40 -08:00 |
|
Phil Wang
|
79c864d796
|
link to community youtuber
|
2021-11-24 08:13:52 -08:00 |
|
Phil Wang
|
b45c1356a1
|
cleanup
|
2021-11-22 22:53:02 -08:00 |
|
Phil Wang
|
ff44d97cb0
|
make initial channels customizable for PiT
0.24.2
|
2021-11-22 18:08:49 -08:00 |
|
Phil Wang
|
d35345df6a
|
remove wip
|
2021-11-22 17:43:04 -08:00 |
|
Phil Wang
|
b69b5af34f
|
dynamic positional bias for crossformer the more efficient way as described in appendix of paper
0.24.1
|
2021-11-22 17:39:36 -08:00 |
|
Phil Wang
|
36e32b70fb
|
complete and release crossformer
0.24.0
|
2021-11-22 17:10:53 -08:00 |
|
Phil Wang
|
768e47441e
|
crossformer without dynamic position bias
|
2021-11-22 16:21:55 -08:00 |
|
Phil Wang
|
de0b8ba189
|
additional diagram
|
2021-11-22 14:05:39 -08:00 |
|
Phil Wang
|
6665fc6cd1
|
cleanup region vit
|
2021-11-22 12:42:24 -08:00 |
|
Phil Wang
|
5b2382f9f0
|
intent to add
|
2021-11-22 12:00:03 -08:00 |
|
Phil Wang
|
9f8c60651d
|
clearer mae
|
2021-11-22 10:19:48 -08:00 |
|
Phil Wang
|
5ae555750f
|
add SimMIM
0.23.2
|
2021-11-21 15:50:19 -08:00 |
|
Phil Wang
|
c5a461661c
|
Merge pull request #170 from ankandrew/patch-1
add Table of Contents
|
2021-11-17 16:55:09 -08:00 |
|
ankandrew
|
e212918e2d
|
add Table of Contents
|
2021-11-17 21:21:19 -03:00 |
|
Phil Wang
|
dc57c75478
|
cleanup
|
2021-11-14 12:24:48 -08:00 |
|
Phil Wang
|
99c44cf5f6
|
readme
|
2021-11-14 11:49:12 -08:00 |
|
Phil Wang
|
5b16e8f809
|
readme
|
2021-11-12 20:19:38 -08:00 |
|
Phil Wang
|
e8f6d72033
|
release masked autoencoder
0.22.0
|
2021-11-12 20:08:48 -08:00 |
|
Phil Wang
|
cb1729af28
|
more efficient feedforward for regionvit
0.21.1
|
2021-11-07 17:18:59 -08:00 |
|
Phil Wang
|
9e50b2a41e
|
readme
|
2021-11-07 09:59:49 -08:00 |
|
Phil Wang
|
06d375351e
|
add RegionViT paper
0.21.0
|
2021-11-07 09:47:28 -08:00 |
|
Phil Wang
|
f196d1ec5b
|
move freqs in RvT to linspace
0.20.8
|
2021-10-05 09:23:44 -07:00 |
|