INDEX
Explanations
expressions related to legal or illegal activities
New Auto-Interp
Head Attr Weights
0:0.05
1:0.03
2:0.09
3:0.08
4:0.06
5:0.23
6:0.04
7:0.07
8:0.06
9:0.13
10:0.07
11:0.03
Negative Logits
†
-2.48
Limit
-2.43
antim
-2.31
MB
-2.20
taboola
-2.18
derog
-2.18
�
-2.17
TM
-2.17
MA
-2.10
ICC
-2.06
POSITIVE LOGITS
Then
3.42
awoke
3.36
Eventually
3.33
hadn
3.30
sprang
3.25
Suddenly
3.25
then
3.01
woke
2.94
didn
2.90
toggle
2.88
Activations Density 0.099%