INDEX
Explanations
phrases indicating a high level of confidence or suspicion
expressions of certainty or doubt
New Auto-Interp
Negative Logits
artney
-0.79
irlf
-0.76
cial
-0.73
idelines
-0.70
packs
-0.68
ife
-0.66
uit
-0.66
fund
-0.65
bonding
-0.65
isting
-0.64
POSITIVE LOGITS
poke
0.76
ļé
0.75
£ı
0.70
é¾į
0.69
Ͻ
0.69
Rasmussen
0.63
âĶĢâĶĢâĶĢâĶĢ
0.63
Īè
0.62
myself
0.62
admission
0.62
Activations Density 0.132%