diff --git a/README.md b/README.md
index 89e7a9c..d88d10a 100644
--- a/README.md
+++ b/README.md
@@ -777,7 +777,7 @@ pred = mbvit_xs(img) # (1, 1000)
-This paper introduces the cross correlation attention (abbreviated XCA). One can think of it as doing attention across the features dimension rather than the spatial one (another perspective would be a dynamic 1x1 convolution, the kernel being attention map defined by spatial correlations).
+This paper introduces the cross covariance attention (abbreviated XCA). One can think of it as doing attention across the features dimension rather than the spatial one (another perspective would be a dynamic 1x1 convolution, the kernel being attention map defined by spatial correlations).
Technically, this amounts to simply transposing the query, key, values before executing cosine similarity attention with learned temperature.