INDEX
Explanations
adjectives that describe intensity or characteristics
descriptive words and terms related to film and societal themes
New Auto-Interp
Negative Logits
OIL
-0.84
uther
-0.81
avia
-0.79
qqa
-0.79
redits
-0.78
besides
-0.76
chwitz
-0.76
Specific
-0.75
registered
-0.75
RF
-0.75
POSITIVE LOGITS
nature
1.14
confines
1.06
USS
1.02
portion
1.00
aspect
0.93
masses
0.91
version
0.89
process
0.89
portions
0.88
world
0.87
Activations Density 0.332%