INDEX
Explanations
questions and inquiries within the text
New Auto-Interp
Negative Logits
entanto
-0.72
########.
-0.64
Quindi
-0.64
anmoins
-0.63
correctes
-0.60
oa̍t
-0.59
لكن
-0.58
Namun
-0.55
Namun
-0.55
endforeach
-0.55
POSITIVE LOGITS
answer
1.37
answers
1.29
answer
1.21
Answer
1.20
Answer
1.14
答案
1.12
answered
1.08
Answers
1.04
answers
1.03
Answers
0.99
Activations Density 0.180%