INDEX
Explanations
words related to linguistic and computational terms, specifically words ending in '-ing', '-ed', '-ly', '-er', or '-s'
words related to linguistic forms and suffixes
New Auto-Interp
Negative Logits
KP
-0.70
schild
-0.69
iewicz
-0.67
Hayden
-0.65
Sutherland
-0.63
Immunity
-0.63
Helsinki
-0.62
Luk
-0.61
TAM
-0.60
Elves
-0.60
POSITIVE LOGITS
mble
1.00
cean
0.92
moil
0.87
uin
0.85
gment
0.82
renheit
0.81
rill
0.80
mph
0.80
odox
0.79
ased
0.78
Activations Density 0.168%