INDEX
Explanations
phrases indicating a reaction or response to external circumstances
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.14
3:0.08
4:0.21
5:0.05
6:0.08
7:0.16
8:0.05
9:0.05
10:0.06
11:0.04
Negative Logits
Libre
-1.53
SPONSORED
-1.40
oided
-1.39
女
-1.37
alli
-1.36
afia
-1.35
LIB
-1.35
iversity
-1.29
milo
-1.27
Ended
-1.25
POSITIVE LOGITS
existing
1.46
izontal
1.40
ardon
1.38
umph
1.27
congrat
1.25
enhagen
1.21
bum
1.20
utterstock
1.19
respective
1.18
customary
1.17
Activations Density 0.000%