INDEX
Explanations
verbs or phrases related to providing support or validation
expressions related to support or endorsement
New Auto-Interp
Negative Logits
entric
-0.88
itizen
-0.74
orp
-0.69
inational
-0.68
ities
-0.68
icago
-0.68
ILCS
-0.66
nesota
-0.66
ptives
-0.66
anny
-0.65
POSITIVE LOGITS
track
1.04
ped
0.84
up
0.80
drive
0.76
stab
0.76
away
0.76
GROUND
0.75
lash
0.73
dash
0.72
tracking
0.71
Activations Density 0.047%