INDEX
Explanations
modal verbs followed by be/verb
New Auto-Interp
Negative Logits
nightlife
1.27
terrific
1.21
patriotism
1.21
carnage
1.19
murky
1.18
rambling
1.16
television
1.16
dozens
1.14
spectacular
1.13
sexuality
1.13
POSITIVE LOGITS
be
1.19
denoted
0.99
Hence
0.99
create
0.97
not
0.96
تكون
0.95
dapat
0.95
mempunyai
0.95
يكون
0.94
could
0.92
Activations Density 0.420%