INDEX
Explanations
mentions of cats and related terms
New Auto-Interp
Negative Logits
PYX
-0.73
Aguilera
-0.68
avax
-0.68
makeConstraints
-0.66
près
-0.64
Αυ
-0.62
----------
-0.61
trouw
-0.60
trong
-0.59
떻
-0.59
POSITIVE LOGITS
cat
2.62
Cat
2.51
cats
2.50
Cat
2.34
Cats
2.33
Cats
2.24
cat
2.21
CAT
2.11
cats
2.03
CAT
1.89
Activations Density 0.073%