INDEX
Explanations
terms associated with friendliness and camaraderie
New Auto-Interp
Negative Logits
-equipped
-0.20
-trigger
-0.16
ovu
-0.16
ulos
-0.15
sharp
-0.15
urette
-0.15
otti
-0.15
kova
-0.15
izada
-0.15
urator
-0.14
POSITIVE LOGITS
ly
0.83
LY
0.56
liness
0.55
liest
0.46
lys
0.43
lier
0.40
lyph
0.39
hood
0.39
lies
0.36
ely
0.33
Activations Density 0.073%