INDEX
Explanations
references to questioning or inquiry, particularly using the word "Who."
New Auto-Interp
Negative Logits
ある
-0.55
former
-0.54
second
-0.54
chera
-0.52
rakech
-0.52
CRP
-0.51
lans
-0.51
lej
-0.51
circo
-0.51
particular
-0.49
POSITIVE LOGITS
Who
2.07
Who
2.06
who
1.85
who
1.84
WHO
1.72
hvem
1.70
WHO
1.68
quién
1.54
quién
1.49
Siapa
1.49
Activations Density 0.078%