INDEX
Explanations
references to governmental or institutional entities and their actions
phrases related to causes or explanations
New Auto-Interp
Negative Logits
thood
-0.91
heit
-0.89
strate
-0.79
ature
-0.78
etsy
-0.73
ifies
-0.71
vier
-0.70
ivo
-0.69
ibi
-0.69
jer
-0.68
POSITIVE LOGITS
sheer
1.14
aforementioned
1.12
fact
1.10
absence
1.02
influx
1.02
emergence
1.02
latter
1.01
likes
1.01
presence
1.00
availability
0.99
Activations Density 0.333%