INDEX
Explanations
mentions of a specific name, possibly "Kamala"
the repeated mention of the term "ala," which likely refers to a specific topic or entity in the text
New Auto-Interp
Negative Logits
dit
-0.87
LY
-0.85
lines
-0.84
line
-0.83
ly
-0.83
lessly
-0.82
lers
-0.81
lies
-0.80
liness
-0.79
leness
-0.79
POSITIVE LOGITS
uthor
1.08
ibaba
1.02
emia
0.96
Pradesh
0.96
isa
0.90
qua
0.86
Haram
0.82
velength
0.81
ista
0.81
pha
0.78
Activations Density 0.017%