INDEX
Explanations
varied forms of verbs and noun phrases related to group actions and functions
New Auto-Interp
Negative Logits
Frost
-0.17
ük
-0.16
ixed
-0.15
аÐ
-0.14
Ctx
-0.14
odd
-0.13
agas
-0.13
Vers
-0.13
ued
-0.13
rych
-0.13
POSITIVE LOGITS
bate
0.16
iggs
0.16
artner
0.16
informational
0.16
pire
0.15
toi
0.15
erval
0.15
ditor
0.15
editorial
0.14
academic
0.14
Activations Density 0.097%