INDEX
Explanations
questions and statements in a text
New Auto-Interp
Negative Logits
ufact
-0.81
photos
-0.71
olia
-0.67
robe
-0.66
ilon
-0.65
natureconservancy
-0.63
Lago
-0.62
ool
-0.61
Sigma
-0.61
Weapons
-0.61
POSITIVE LOGITS
answered
1.57
answered
1.48
unanswered
1.45
answer
1.44
answering
1.29
Answer
1.29
answers
1.28
answ
1.26
asked
1.19
Answers
1.13
Activations Density 0.190%