INDEX
Explanations
phrases related to personal matters or personal actions
references to personal information or matters
New Auto-Interp
Negative Logits
xual
-0.88
Faster
-0.81
Swarm
-0.80
rica
-0.74
Clever
-0.73
GGGG
-0.73
IVERS
-0.71
Survive
-0.71
Ingram
-0.71
Spac
-0.71
POSITIVE LOGITS
injury
0.87
ised
0.84
dealings
0.81
pronouns
0.80
belongings
0.79
adviser
0.79
liberties
0.79
liberty
0.78
polit
0.77
afort
0.77
Activations Density 0.019%