INDEX
Explanations
references to high-ranking institutions or authorities
New Auto-Interp
Negative Logits
vier
-0.75
arium
-0.75
rall
-0.74
oidal
-0.72
itte
-0.71
ious
-0.71
vernment
-0.70
OPLE
-0.69
okemon
-0.68
uish
-0.66
POSITIVE LOGITS
landers
1.23
lighting
1.12
lights
1.01
School
0.98
nesses
0.96
lander
0.93
light
0.93
school
0.91
Definition
0.83
ness
0.83
Activations Density 0.026%