INDEX
Explanations
references to a specific person named Kim Jong
references to Kim Jong Un and related figures
New Auto-Interp
Negative Logits
rawdownloadcloneembedreportprint
-0.79
ANT
-0.74
ATTLE
-0.69
arians
-0.69
WORK
-0.69
Results
-0.67
VID
-0.67
MENTS
-0.65
antic
-0.65
BACK
-0.64
POSITIVE LOGITS
ongyang
1.01
sung
0.92
Jong
0.90
Suk
0.83
unta
0.81
resy
0.81
jong
0.77
Jinping
0.77
lasses
0.77
jri
0.77
Activations Density 0.016%