INDEX
Explanations
statements and questions reflecting correctness and understanding
assertions or statements where someone claims to be correct or right about something.
New Auto-Interp
Negative Logits
positories
-0.35
gea
-0.33
circ
-0.30
casus
-0.30
AxisAlignment
-0.29
kär
-0.29
Asbury
-0.29
trauma
-0.28
ComVisible
-0.28
inkább
-0.28
POSITIVE LOGITS
correct
2.63
wrong
2.41
Correct
2.39
Correct
2.39
correct
2.36
incorrect
2.22
CORRECT
2.17
wrong
2.09
Wrong
2.08
WRONG
2.00
Activations Density 0.729%