INDEX
Explanations
proper nouns related to people or places
mentions of the name "Nav" or related variations
New Auto-Interp
Negative Logits
chnology
-0.74
hyde
-0.74
gracious
-0.73
è£ı
-0.72
xual
-0.72
FORE
-0.71
Reloaded
-0.69
cruelty
-0.65
Ͻ
-0.64
contraception
-0.63
POSITIVE LOGITS
arro
1.39
igator
1.36
igating
1.27
igation
1.22
igators
1.22
ajo
1.21
igate
1.12
arre
1.10
arette
1.01
igated
0.99
Activations Density 0.009%