INDEX
Explanations
drawing analogies with like/as
New Auto-Interp
Negative Logits
일까지
0.83
역시
0.79
మాత్రం
0.79
까지
0.77
حتی
0.77
even
0.77
Even
0.76
nawet
0.76
вовсе
0.75
even
0.73
POSITIVE LOGITS
headlights
0.83
upscale
0.81
highlighter
0.81
appetizers
0.79
speedometer
0.78
someone
0.77
somebody
0.77
quelqu
0.76
punya
0.76
catcher
0.75
Activations Density 0.482%