INDEX
Explanations
comparisons between different entities or concepts
phrases discussing differences in context or comparison
New Auto-Interp
Negative Logits
tti
-0.73
ilyn
-0.72
ccording
-0.72
everal
-0.68
PDATE
-0.66
bars
-0.65
Corn
-0.65
wind
-0.65
ÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤ
-0.64
lynn
-0.63
POSITIVE LOGITS
afar
0.99
ours
0.98
theirs
0.85
ordinary
0.76
scrimmage
0.75
hers
0.74
whence
0.74
anywhere
0.70
what
0.70
parap
0.66
Activations Density 0.068%