INDEX
Explanations
verbs or phrases related to making appeals or appeals being made
phrases expressing various forms of appeals or requests
New Auto-Interp
Negative Logits
sterdam
-0.93
llan
-0.76
isk
-0.72
alde
-0.69
auga
-0.67
stadt
-0.67
hered
-0.64
kes
-0.62
holm
-0.62
grad
-0.62
POSITIVE LOGITS
plea
0.77
gauge
0.72
ĸļ
0.70
refrain
0.70
plead
0.69
Revival
0.68
gull
0.68
pleas
0.67
reconsider
0.67
appease
0.65
Activations Density 0.125%