INDEX
Explanations
descriptors indicating loudness or intensity in various contexts
New Auto-Interp
Negative Logits
alue
-0.17
bare
-0.16
IBE
-0.15
itone
-0.15
imum
-0.14
asal
-0.14
reinterpret
-0.14
Sink
-0.14
andal
-0.13
ä»ĺãģį
-0.13
POSITIVE LOGITS
ness
0.35
NESS
0.24
enough
0.17
nes
0.17
halt
0.15
enes
0.15
ening
0.15
395
0.14
uden
0.14
Trad
0.14
Activations Density 0.038%