INDEX
Explanations
occurrences of the word "Contr" in various contexts
New Auto-Interp
Negative Logits
owie
-0.17
usercontent
-0.17
mun
-0.16
oldt
-0.16
çī
-0.16
verbatim
-0.15
trä
-0.15
oyer
-0.15
osi
-0.14
Ùħع
-0.14
POSITIVE LOGITS
ictory
0.18
ector
0.17
opposite
0.16
433
0.16
ional
0.15
wind
0.15
contr
0.15
egt
0.15
ors
0.15
cona
0.15
Activations Density 0.033%