INDEX
Explanations
phrases indicating contact or interaction with others
instances of the phrase "getting in" or variations thereof
New Auto-Interp
Negative Logits
brim
-0.57
ALD
-0.57
EMP
-0.55
silhou
-0.53
Bullets
-0.53
pall
-0.53
Scand
-0.52
=]
-0.52
cia
-0.52
husbands
-0.51
POSITIVE LOGITS
ordinate
0.97
offensive
0.93
touch
0.88
byss
0.85
roads
0.83
bred
0.82
ked
0.81
hibited
0.80
Touch
0.79
utters
0.79
Activations Density 0.055%