INDEX
Explanations
mentions of dramatic or intriguing developments or situations
New Auto-Interp
Negative Logits
velt
-0.73
amiya
-0.64
Britann
-0.64
Examiner
-0.58
heid
-0.57
iege
-0.57
Costume
-0.56
hypothesis
-0.54
isse
-0.54
orney
-0.53
POSITIVE LOGITS
rid
1.02
cloneembedreportprint
0.88
progressively
0.81
underway
0.75
tin
0.74
TING
0.71
aways
0.71
bog
0.70
louder
0.70
traction
0.70
Activations Density 6.705%