INDEX
Explanations
the word "Cap" with varying levels of importance or relevance
capitalized words or terms
New Auto-Interp
Negative Logits
hower
-0.91
ĪĴ
-0.77
IGHTS
-0.74
silence
-0.71
¿½
-0.71
anten
-0.65
gaard
-0.64
Nare
-0.63
darts
-0.63
bye
-0.61
POSITIVE LOGITS
itol
1.42
itals
1.27
acity
1.27
rice
1.27
abilities
1.26
rices
1.25
illary
1.21
uchin
1.16
rylic
1.11
itated
1.09
Activations Density 0.015%