INDEX
Explanations
superlative adjectives and ordinal numbers
terms related to firsts or notable mentions
beginnings or main components
New Auto-Interp
Negative Logits
als
-0.40
only
-0.36
sed
-0.36
於
-0.36
tr
-0.35
yarnpkg
-0.35
Only
-0.35
,
-0.35
din
-0.35
kaynağından
-0.34
POSITIVE LOGITS
itſelf
0.87
requirement
0.84
pleaſure
0.82
thing
0.81
numberWith
0.79
requirement
0.78
resourceCulture
0.77
mistake
0.75
ingredient
0.75
ſtate
0.75
Activations Density 2.357%