INDEX
Explanations
highlights of individuals or personalities
repeated syllables in words or phrases
New Auto-Interp
Negative Logits
ICLE
-0.59
estial
-0.59
ibilities
-0.58
iop
-0.57
uo
-0.57
abouts
-0.55
idon
-0.55
iosity
-0.54
uten
-0.54
ometric
-0.53
POSITIVE LOGITS
ffer
0.63
Brach
0.61
FG
0.61
ĵĺ
0.59
ffe
0.59
FN
0.59
kefeller
0.58
legate
0.57
Var
0.57
æ©
0.55
Activations Density 0.127%