INDEX
Explanations
"The" followed by specific entities
New Auto-Interp
Negative Logits
entlichen
0.70
防水
0.61
leukin
0.57
단
0.57
adap
0.54
석
0.54
bestimmten
0.54
pract
0.53
기업
0.53
mű
0.53
POSITIVE LOGITS
country
0.81
Confederacy
0.74
situation
0.72
Philippines
0.69
United
0.63
collapse
0.63
aftermath
0.62
Netherlands
0.61
country
0.61
plight
0.61
Activations Density 0.149%