INDEX
Explanations
entities related to locations and organizations
Stone Bridge, Raising Happiness, evil Anton
New Auto-Interp
Negative Logits
,
-0.41
ab
-0.36
<eos>
-0.33
other
-0.33
because
-0.33
or
-0.32
جم
-0.32
al
-0.31
ne
-0.31
im
-0.31
POSITIVE LOGITS
pleaſure
0.76
+#+#
0.74
leſs
0.73
houſe
0.73
faſt
0.71
Theſe
0.70
ſta
0.69
ſeine
0.65
queſta
0.65
avoient
0.64
Activations Density 0.062%