INDEX
Negative Logits
Though
0.71
Whilst
0.61
amongst
0.60
whilst
0.59
Whilst
0.59
though
0.57
Additionally
0.57
Lastly
0.54
Utilizing
0.54
Though
0.54
POSITIVE LOGITS
Presumably
0.66
bureaucrats
0.65
putative
0.63
reformers
0.62
policymakers
0.58
presumably
0.56
inevitably
0.55
bureaucr
0.54
insofar
0.52
hapless
0.50
Activations Density 0.008%