INDEX
Explanations
comparisons between two entities, especially highlighting the second mentioned entity
the term "latter" in various contexts indicating comparisons or distinctions
New Auto-Interp
Negative Logits
Clover
-0.70
9999
-0.70
cars
-0.69
Cars
-0.68
arij
-0.68
Cola
-0.68
Magn
-0.67
grid
-0.67
oku
-0.64
hers
-0.63
POSITIVE LOGITS
mentioned
0.91
stages
0.85
ingly
0.73
nder
0.73
worldly
0.69
most
0.69
part
0.68
dilig
0.68
type
0.66
sort
0.66
Activations Density 0.034%