INDEX
Explanations
phrases indicating belonging or being part of a group or community
phrases that emphasize belonging or being part of a group
New Auto-Interp
Negative Logits
assumes
-0.66
ceilings
-0.65
ptive
-0.65
incurred
-0.63
withd
-0.62
directs
-0.61
spouses
-0.60
culosis
-0.60
iasis
-0.59
ants
-0.58
POSITIVE LOGITS
¬¼
0.72
the
0.70
Team
0.70
İĭ
0.68
Team
0.68
axy
0.67
ģĸ
0.67
something
0.66
Kinnikuman
0.65
circle
0.64
Activations Density 0.101%