INDEX
Explanations
names of individuals
mentions of specific individuals, particularly their names
New Auto-Interp
Negative Logits
xual
-1.01
clad
-0.78
fracture
-0.73
clad
-0.73
oulos
-0.68
permitting
-0.68
fracturing
-0.65
eleph
-0.62
situ
-0.62
totality
-0.61
POSITIVE LOGITS
rie
0.92
arten
0.91
glers
0.87
ienne
0.86
arant
0.85
linger
0.84
dp
0.84
enne
0.83
rik
0.83
olf
0.83
Activations Density 0.019%