INDEX
Explanations
adverbs indicating generalizations or common truths
phrases that express generalizations or broadly applicable statements
New Auto-Interp
Negative Logits
ÄŁ
-0.70
lyn
-0.69
Tycoon
-0.69
acity
-0.66
Mysteries
-0.65
illion
-0.65
Frenzy
-0.64
Orchestra
-0.64
ilion
-0.63
poke
-0.62
POSITIVE LOGITS
speaking
1.33
regarded
1.00
frowned
0.89
accepted
0.88
speaking
0.84
construed
0.83
disliked
0.77
WAYS
0.75
considered
0.75
disappro
0.74
Activations Density 0.045%