INDEX
Explanations
evaluative and comparative adjectives related to size and significance
New Auto-Interp
Negative Logits
ROM
-0.15
fik
-0.15
uj
-0.15
Böyle
-0.14
anj
-0.14
achten
-0.14
_SIG
-0.14
á»ijt
-0.14
Oyun
-0.13
ract
-0.13
POSITIVE LOGITS
yet
0.19
yet
0.18
possible
0.17
possible
0.17
Yet
0.16
aller
0.15
amongst
0.15
imaginable
0.15
åİ
0.15
ाà¤Ĭ
0.15
Activations Density 0.409%