INDEX
Explanations
adjectives and adverbs expressing varying degrees of intensity or emphasis
New Auto-Interp
Head Attr Weights
0:0.12
1:0.04
2:0.07
3:0.15
4:0.07
5:0.10
6:0.03
7:0.07
8:0.05
9:0.06
10:0.13
11:0.06
Negative Logits
Kessler
-0.97
Pandora
-0.82
プ
-0.81
spray
-0.80
Insight
-0.79
Wong
-0.78
Invisible
-0.78
clipping
-0.78
IRC
-0.77
Cir
-0.77
POSITIVE LOGITS
atto
1.14
iga
1.07
auld
0.99
athed
0.96
nered
0.94
mercial
0.94
VALUE
0.93
mented
0.93
xual
0.92
igan
0.92
Activations Density 0.142%