INDEX
Explanations
phrases related to comparison or different aspects of something
expressions of multiple interpretations or perspectives on a topic
New Auto-Interp
Negative Logits
bart
-0.66
tailed
-0.63
andel
-0.62
alsa
-0.61
ctors
-0.61
perty
-0.61
prus
-0.60
lish
-0.60
multiple
-0.59
mins
-0.59
POSITIVE LOGITS
resembles
0.75
analogous
0.72
resembling
0.71
,
0.68
reminiscent
0.68
resemble
0.67
mirror
0.64
embodies
0.64
it
0.63
this
0.61
Activations Density 0.088%