vit-pytorch

mirror of https://github.com/lucidrains/vit-pytorch.git synced 2025-12-30 08:02:29 +00:00

Author	SHA1	Message	Date
Phil Wang	78489045cd	readme	2021-03-09 19:23:09 -08:00
Phil Wang	173e07e02e	cleanup and release 0.8.0 0.8.0	2021-03-08 07:28:31 -08:00
Phil Wang	0e63766e54	Merge pull request #66 from zankner/masked_patch_pred Masked Patch Prediction "Suggested in #63" Work in Progress	2021-03-08 07:21:52 -08:00
Zack Ankner	a6cbda37b9	added to readme	2021-03-08 09:34:55 -05:00
Zack Ankner	73de1e8a73	converting bin targets to hard labels	2021-03-07 12:19:30 -05:00
Phil Wang	1698b7bef8	make it so one can plug performer into t2tvit 0.7.6	2021-02-25 20:55:34 -08:00
Phil Wang	6760d554aa	no need to do projection to combine attention heads for T2Ts initial one-headed attention layers 0.7.4	2021-02-24 12:23:39 -08:00
Phil Wang	a82894846d	add DistillableT2TViT	2021-02-21 19:54:45 -08:00
Phil Wang	3744ac691a	remove patch size from T2TViT	2021-02-21 19:15:19 -08:00
Phil Wang	6af7bbcd11	make sure distillation still works 0.7.1	2021-02-21 19:08:18 -08:00
Phil Wang	05edfff33c	cleanup	2021-02-20 11:32:38 -08:00
Phil Wang	e3205c0a4f	add token to token ViT	2021-02-19 22:28:53 -08:00
Phil Wang	4fc7365356	incept idea for using nystromformer	2021-02-17 15:30:45 -08:00
Phil Wang	3f2cbc6e23	fix for ambiguity in broadcasting mask 0.6.8	2021-02-17 07:38:11 -08:00
Zack Ankner	fc14561de7	made bit boundaries a function of output bits and max pixel val, fixed spelling error and reset vit_pytorch to og file	2021-02-13 18:19:21 -07:00
Zack Ankner	be5d560821	mpp loss is now based on descritized average pixels, vit forward unchanged	2021-02-12 18:30:56 -07:00
Zack Ankner	77703ae1fc	moving mpp loss into wrapper	2021-02-10 21:47:49 -07:00
Zack Ankner	a0a4fa5e7d	Working implementation of masked patch prediction as a wrapper. Need to clean code up	2021-02-09 22:55:06 -07:00
Zack Ankner	174e71cf53	Wrapper for masked patch prediction. Built handling of input and masking of patches. Need to work on integrating into vit forward call and mpp loss function	2021-02-07 16:49:06 -05:00
Zack Ankner	e14bd14a8f	Prelim work on masked patch prediction for self supervision	2021-02-04 22:00:02 -05:00
Phil Wang	85314cf0b6	patch for scaling factor, thanks to @urkax 0.6.7	2021-01-21 09:39:42 -08:00
Phil Wang	5db8d9deed	update readme about non-square images	2021-01-12 06:55:45 -08:00
Phil Wang	e8ca6038c9	allow for DistillableVit to still run predictions 0.6.5	2021-01-11 10:49:14 -08:00
Phil Wang	1106a2ba88	link to official repo	2021-01-08 08:23:50 -08:00
Phil Wang	f95fa59422	link to resources for vision people	2021-01-04 10:10:54 -08:00
Phil Wang	be1712ebe2	add quote	2020-12-28 10:22:59 -08:00
Phil Wang	1a76944124	update readme	2020-12-27 19:10:38 -08:00
Phil Wang	2263b7396f	allow distillable efficient vit to restore efficient vit as well 0.6.4	2020-12-25 19:31:25 -08:00
Phil Wang	74074e2b6c	offer easy way to turn DistillableViT to ViT at the end of training 0.6.3	2020-12-25 11:16:52 -08:00
Phil Wang	0c68688d61	bump for release	2020-12-25 09:30:48 -08:00
Phil Wang	5918f301a2	cleanup	2020-12-25 09:30:38 -08:00
Phil Wang	4a6469eecc	Merge pull request #51 from umbertov/main Add class for distillation with efficient attention	2020-12-25 09:21:17 -08:00
Umberto Valleriani	5a225c8e3f	Add class for distillation with efficient attention DistillableEfficientViT does the same as DistillableViT, except it may accept a custom transformer encoder, possibly implementing an efficient attention mechanism	2020-12-25 13:46:29 +01:00
Phil Wang	e0007bd801	add distill diagram	2020-12-24 11:34:15 -08:00
Phil Wang	db98ed7a8e	allow for overriding alpha as well on forward in distillation wrapper 0.6.2	2020-12-24 11:18:36 -08:00
Phil Wang	dc4b3327ce	no grad for teacher in distillation	2020-12-24 11:11:58 -08:00
Phil Wang	aa8f0a7bf3	Update README.md	2020-12-24 10:59:03 -08:00
Phil Wang	34e6284f95	Update README.md	2020-12-24 10:58:41 -08:00
Phil Wang	aa9ed249a3	add knowledge distillation with distillation tokens, in light of new finding from facebook ai 0.6.0	2020-12-24 10:39:15 -08:00
Phil Wang	ea0924ec96	update readme	2020-12-23 19:06:48 -08:00
Phil Wang	59787a6b7e	allow for mean pool with efficient version too 0.5.1	2020-12-23 18:15:40 -08:00
Phil Wang	24339644ca	offer a way to use mean pooling of last layer 0.5.0	2020-12-23 17:23:58 -08:00
Phil Wang	b786029e18	fix the dimension per head to be independent of dim and heads, to make sure users do not have it be too small to learn anything 0.4.0	2020-12-17 07:43:52 -08:00
Phil Wang	9624181940	simplify mlp head 0.3.0	2020-12-07 14:31:50 -08:00
Phil Wang	a656a213e6	update diagram	2020-12-04 12:26:28 -08:00
Phil Wang	f1deb5fb7e	Merge pull request #31 from minhlong94/main Update README and documentation	2020-11-21 08:05:38 -08:00
Long M. Lưu	3f50dd72cf	Update README.md	2020-11-21 18:37:03 +07:00
Long M. Lưu	ee5e4e9929	Update vit_pytorch.py	2020-11-21 18:23:04 +07:00
Phil Wang	6c8dfc185e	remove float(-inf) as masking value 0.2.7	2020-11-13 12:25:21 -08:00
Phil Wang	4f84ad7a64	authors are now known	2020-11-03 14:28:20 -08:00

1 2

78 Commits