INDEX
Explanations
prepositions followed by specific nouns or entities
references to "the" in various contexts
New Auto-Interp
Negative Logits
isson
-0.75
illes
-0.75
hari
-0.71
aram
-0.68
ohm
-0.67
ieve
-0.66
fman
-0.66
Edited
-0.66
ledged
-0.64
ptions
-0.64
POSITIVE LOGITS
sake
1.42
foreseeable
1.05
fledgling
0.99
entire
0.92
purposes
0.92
wearer
0.90
rest
0.87
nation
0.85
budding
0.85
nascent
0.84
Activations Density 0.252%