INDEX
Explanations
phrases indicating contrast or contradiction
negation and phrases that express conditional relationships
New Auto-Interp
Negative Logits
chnology
-0.57
士
-0.55
²¾
-0.55
auri
-0.54
available
-0.54
iHUD
-0.52
cms
-0.52
ricks
-0.51
cream
-0.51
gil
-0.51
POSITIVE LOGITS
Xander
0.53
THR
0.51
apologies
0.49
ãĤĵ
0.46
Woodward
0.45
Miliband
0.45
Opinion
0.44
Ender
0.44
congr
0.43
pardon
0.43
Activations Density 1.114%