INDEX
Explanations
common prepositions and connecting words
New Auto-Interp
Negative Logits
magnification
-0.66
compe
-0.63
[*
-0.62
phrine
-0.59
URA
-0.58
represent
-0.58
psc
-0.58
Pse
-0.57
anus
-0.57
hospitality
-0.56
POSITIVE LOGITS
agos
0.98
wine
0.98
odor
0.89
reau
0.87
ÃŃn
0.85
ieri
0.82
cci
0.80
Ferry
0.80
wald
0.79
ondo
0.79
Activations Density 0.021%