INDEX
Explanations
references to interactions or communications involving the speaker
references to personal communication or interactions
New Auto-Interp
Negative Logits
¿½
-0.68
rawdownloadcloneembedreportprint
-0.63
Execution
-0.63
ESV
-0.62
profits
-0.62
ģĸ
-0.61
lance
-0.59
Wikimedia
-0.59
vi
-0.58
cing
-0.57
POSITIVE LOGITS
symp
0.75
asking
0.73
bluff
0.73
backstage
0.73
verbally
0.72
politely
0.72
afterward
0.71
privately
0.71
questioning
0.71
orally
0.71
Activations Density 0.139%