INDEX
Explanations
words that reflect actions or processes, particularly those ending in 'ing' and variations of the word 'way'
New Auto-Interp
Negative Logits
ÃŃky
-0.16
ered
-0.14
orns
-0.14
ng
-0.14
cion
-0.14
iek
-0.14
etak
-0.14
-fontawesome
-0.13
gel
-0.13
(^
-0.13
POSITIVE LOGITS
597
0.16
CLUB
0.15
ittest
0.15
èįī
0.15
óst
0.14
برÛĮ
0.14
verb
0.14
bail
0.14
uba
0.13
club
0.13
Activations Density 0.006%