INDEX
Explanations
phrases indicating possession or requests for information
New Auto-Interp
Negative Logits
ogn
-0.15
ano
-0.14
uced
-0.14
omm
-0.14
sher
-0.14
Ïĩν
-0.14
isk
-0.14
оÑĤказ
-0.13
demise
-0.13
atis
-0.13
POSITIVE LOGITS
questions
0.32
Questions
0.26
ever
0.25
questions
0.24
any
0.23
Questions
0.21
experience
0.20
Fragen
0.19
trouble
0.19
concerns
0.19
Activations Density 0.088%