INDEX
Explanations
occurrences of the plural form of a noun
New Auto-Interp
Negative Logits
sacrific
-0.64
Sci
-0.61
contrace
-0.61
hither
-0.61
proposition
-0.60
rama
-0.60
Cruiser
-0.58
Herod
-0.58
Ottoman
-0.57
disadvant
-0.56
POSITIVE LOGITS
keleton
1.00
leeve
0.96
ixty
0.96
uggest
0.95
iblings
0.95
atisf
0.92
ptions
0.91
ouls
0.91
istance
0.90
ket
0.88
Activations Density 0.102%