INDEX
Explanations
references to the Dalai Lama
references to influential leaders, specifically the Dalai Lama and Lenin
New Auto-Interp
Negative Logits
hire
-0.82
reek
-0.81
ties
-0.74
IED
-0.74
ptroller
-0.71
tes
-0.68
EAR
-0.67
aved
-0.67
Rangers
-0.67
tle
-0.65
POSITIVE LOGITS
Lama
1.09
Jinping
0.95
EStream
0.84
utra
0.82
onite
0.78
seiz
0.78
eus
0.77
istani
0.76
DragonMagazine
0.75
Patriarch
0.74
Activations Density 0.013%