INDEX
Explanations
questions being asked in text
questions and statements that begin with "Is," indicating inquiries or examinations of a topic
New Auto-Interp
Negative Logits
ãģĮ
-0.77
è¦ļéĨĴ
-0.72
ð
-0.64
excess
-0.61
ESE
-0.61
lands
-0.61
hoop
-0.60
rites
-0.60
depot
-0.60
IOR
-0.60
POSITIVE LOGITS
olated
1.35
olation
1.25
olate
1.23
abella
1.15
htar
1.08
ync
1.01
sei
1.00
rael
1.00
peria
0.98
terness
0.95
Activations Density 0.085%