INDEX
Explanations
the word "sup" in various forms and contexts
New Auto-Interp
Negative Logits
abyrin
-0.16
eil
-0.16
yalty
-0.15
IPS
-0.15
oui
-0.14
èĹ
-0.14
zon
-0.14
ityEngine
-0.14
zes
-0.14
tod
-0.14
POSITIVE LOGITS
reme
0.37
plement
0.36
posed
0.36
ervised
0.36
ervisor
0.36
plied
0.35
plementary
0.35
pression
0.34
plies
0.34
pose
0.33
Activations Density 0.016%