INDEX
Explanations
statements or quotations in text
dialogue or statements made by individuals
New Auto-Interp
Negative Logits
Vaugh
-0.70
ãĥ¼ãĥĨ
-0.63
Redd
-0.62
amac
-0.60
condem
-0.60
egu
-0.60
disadvant
-0.58
Mobil
-0.57
advoc
-0.57
lapt
-0.57
POSITIVE LOGITS
âĢº
0.55
guiActive
0.49
â
0.49
âľ
0.48
crochet
0.46
ye
0.46
ages
0.45
vernment
0.45
·
0.45
TRI
0.45
Activations Density 0.452%