INDEX
Explanations
references to significant legal or criminal incidents
New Auto-Interp
Head Attr Weights
0:0.03
1:0.03
2:0.03
3:0.10
4:0.03
5:0.04
6:0.02
7:0.03
8:0.09
9:0.06
10:0.07
11:0.41
Negative Logits
ason
-1.88
princ
-1.86
eper
-1.74
Camel
-1.64
veter
-1.60
Railroad
-1.59
flashlight
-1.58
railroad
-1.57
garbage
-1.54
runs
-1.47
POSITIVE LOGITS
":[{"1.86
rawdownloadcloneembedreportprint
1.84
Others
1.73
forgiven
1.72
agonists
1.70
Gender
1.65
::::::::
1.63
MENTS
1.58
####
1.58
ゼウス
1.57
Activations Density 0.087%