INDEX
Explanations
mentions of specific entities or organizations
references to prominent organizations and sports teams
New Auto-Interp
Negative Logits
bearing
-0.61
margin
-0.60
contained
-0.60
ashington
-0.59
abil
-0.58
alone
-0.58
izontal
-0.57
.''.
-0.56
COMPLE
-0.55
66666666
-0.55
POSITIVE LOGITS
guy
0.87
dudes
0.85
guys
0.80
vet
0.79
dude
0.77
psychologist
0.76
gentleman
0.72
vets
0.70
idiots
0.69
photographer
0.69
Activations Density 0.743%