INDEX
Explanations
words related to viewpoint or interpretation
references to varying viewpoints or perceptions
New Auto-Interp
Negative Logits
avis
-0.77
liam
-0.74
nard
-0.68
nar
-0.68
cakes
-0.68
die
-0.67
ertodd
-0.66
icit
-0.65
Interstitial
-0.65
ldon
-0.64
POSITIVE LOGITS
perspectives
0.96
perspective
0.92
viewpoint
0.88
viewpoints
0.83
Perspective
0.78
views
0.76
spection
0.74
lens
0.67
Lens
0.67
yip
0.67
Activations Density 0.021%