INDEX
Explanations
phrases describing beliefs, understandings, or statements about individuals or situations
statements regarding assertions or beliefs about individuals or events
New Auto-Interp
Negative Logits
Submission
-0.61
heid
-0.60
Interstitial
-0.59
comes
-0.59
ajo
-0.59
ment
-0.58
Cancer
-0.58
ettlement
-0.57
avier
-0.56
monkeys
-0.56
POSITIVE LOGITS
ª
0.79
proport
0.75
ocry
0.69
anecd
0.66
¯
0.66
¶
0.64
¬
0.64
distingu
0.63
acknowled
0.63
ij士
0.63
Activations Density 0.105%