INDEX
Explanations
proper names of individuals
mentions of specific individuals or places
New Auto-Interp
Negative Logits
rated
-1.00
rates
-0.89
ration
-0.78
rator
-0.76
leader
-0.73
gamer
-0.73
rators
-0.72
rate
-0.72
agents
-0.71
lar
-0.71
POSITIVE LOGITS
istries
0.98
istry
0.89
ISO
0.85
icals
0.81
Tracy
0.80
Pratt
0.78
\\\\\\\\
0.78
aceous
0.78
izont
0.78
grain
0.77
Activations Density 0.022%