INDEX
Explanations
phrases or references related to significant events or actions affecting individuals
New Auto-Interp
Head Attr Weights
0:0.07
1:0.10
2:0.07
3:0.09
4:0.08
5:0.07
6:0.09
7:0.09
8:0.06
9:0.06
10:0.08
11:0.07
Negative Logits
��
-2.34
ebook
-2.33
mallow
-2.25
Lady
-2.20
Pigs
-2.15
Sheila
-2.10
Maiden
-2.07
Idol
-2.01
Singer
-1.95
shaved
-1.95
POSITIVE LOGITS
concess
2.49
coordinates
2.35
extingu
2.25
rehend
2.23
nit
2.12
░
2.09
detect
2.05
TPPStreamerBot
2.01
areth
2.00
hydrogen
1.93
Activations Density 0.000%