INDEX
Explanations
proper nouns or specific terms associated with locations or events, potentially related to history or news
references to specific Greek terms or phrases
New Auto-Interp
Negative Logits
philos
-0.85
erv
-0.81
ipeg
-0.76
ensical
-0.73
FANTASY
-0.71
oyer
-0.70
Enlightenment
-0.68
irtual
-0.68
ify
-0.67
oppable
-0.66
POSITIVE LOGITS
duct
0.73
Astron
0.67
qs
0.63
Crow
0.63
drill
0.63
Quantity
0.62
phies
0.60
Ba
0.59
bere
0.59
verb
0.59
Activations Density 0.000%