INDEX
Explanations
differences or contrasts between two entities or concepts
repetitive structures and patterns in sentence construction
New Auto-Interp
Negative Logits
isure
-0.63
showc
-0.62
lift
-0.62
rox
-0.60
ãĤ¼ãĤ¦ãĤ¹
-0.59
eruption
-0.59
ain
-0.59
Eva
-0.59
bucket
-0.58
¬¼
-0.58
POSITIVE LOGITS
however
0.81
ours
0.68
though
0.67
000
0.64
Seym
0.64
there
0.64
unin
0.64
whose
0.63
utherford
0.63
devices
0.62
Activations Density 0.098%