INDEX
Explanations
terms related to news articles reporting on events or situations involving people
references to individuals involved in incidents or events, particularly regarding crime and investigation
New Auto-Interp
Head Attr Weights
0:0.06
1:0.04
2:0.06
3:0.09
4:0.02
5:0.24
6:0.07
7:0.06
8:0.11
9:0.05
10:0.11
11:0.05
Negative Logits
latt
-1.16
ゴン
-1.11
Gil
-1.01
burner
-1.01
Potato
-1.01
plet
-1.01
Cannes
-1.00
mush
-0.97
Bliss
-0.97
Mell
-0.96
POSITIVE LOGITS
ardless
1.07
shouldn
1.07
Crime
1.06
nob
1.05
Net
1.05
1.04
seless
1.04
BER
1.03
rac
1.00
eren
1.00
Activations Density 0.237%