INDEX
Explanations
mentions of organizational or governmental bodies
references to various organizational entities or groups
New Auto-Interp
Negative Logits
Kafka
-0.75
Job
-0.68
Dickens
-0.68
Hoover
-0.67
desperation
-0.65
edia
-0.62
kers
-0.62
Zip
-0.59
ophobic
-0.57
Typh
-0.57
POSITIVE LOGITS
guards
1.11
guard
0.90
builders
0.87
anguage
0.85
lain
0.81
building
0.79
weight
0.79
chair
0.78
politic
0.77
shed
0.76
Activations Density 0.017%