INDEX
Explanations
phrases and terms that indicate opinions or perceptions about quality or experiences
New Auto-Interp
Negative Logits
CloseHandle
-0.62
noDo
-0.62
tovers
-0.51
PUBLISHED
-0.51
hemiah
-0.50
ongi
-0.49
orus
-0.48
зулта
-0.48
Diweddarwch
-0.48
GetEnumerator
-0.48
POSITIVE LOGITS
rumored
0.82
rumor
0.78
rumors
0.71
rumour
0.68
RegressionTest
0.64
Infórmanos
0.62
ImageContext
0.61
rumoured
0.60
allegedly
0.58
supposedly
0.58
Activations Density 0.089%