INDEX
Explanations
references to popular culture and social commentary
New Auto-Interp
Negative Logits
Whilst
-1.14
Whilst
-1.14
utilising
-0.93
poichè
-0.91
whilst
-0.88
utilise
-0.83
favourable
-0.80
endeavour
-0.79
endeavours
-0.79
hésite
-0.78
POSITIVE LOGITS
pols
0.77
ain
0.73
bof
0.67
herewith
0.66
lousy
0.66
darn
0.66
…)
0.66
gee
0.65
...)
0.64
sez
0.63
Activations Density 0.901%