INDEX
Explanations
references to quantitative changes or variations
New Auto-Interp
Negative Logits
ยม
-0.16
obia
-0.15
uma
-0.14
層
-0.14
unner
-0.14
andalone
-0.13
kees
-0.13
ipop
-0.13
pronto
-0.13
intendo
-0.13
POSITIVE LOGITS
slight
0.83
slightly
0.76
minor
0.57
slightest
0.52
mildly
0.50
mild
0.50
Minor
0.47
minor
0.45
Minor
0.44
modest
0.43
Activations Density 0.327%