INDEX
Explanations
phrases or sentences where there is confusion or a mix-up in information
instances of confusion or misunderstanding in various contexts
New Auto-Interp
Negative Logits
gone
-0.70
apons
-0.70
bors
-0.69
©¶æ
-0.67
metic
-0.66
helicop
-0.65
odder
-0.64
ternity
-0.64
eatures
-0.64
grad
-0.64
POSITIVE LOGITS
ingly
0.94
Leilan
0.91
Beir
0.80
ively
0.78
confuse
0.76
INESS
0.74
ly
0.73
ilde
0.71
Nunes
0.69
uously
0.68
Activations Density 0.024%