INDEX
Explanations
job interview, website analysis, readings
New Auto-Interp
Negative Logits
="
0.78
’
0.76
rumor
0.75
rumored
0.72
,
0.71
denominator
0.68
rumour
0.68
،
0.67
fashioned
0.67
entrenched
0.66
POSITIVE LOGITS
ER
0.93
Alguns
0.82
ке
0.73
ное
0.73
AN
0.72
ECT
0.71
आर
0.70
ви
0.69
Algun
0.69
นี
0.69
Activations Density 6.960%