INDEX
Explanations
phrases related to legal matters and government actions
references to organizations or treatments related to health issues
New Auto-Interp
Negative Logits
colour
-0.69
dstg
-0.67
iership
-0.63
Spoiler
-0.62
Divinity
-0.59
Laughs
-0.59
Grimoire
-0.58
::::::::
-0.57
righteousness
-0.57
Sorceress
-0.57
POSITIVE LOGITS
however
0.78
reportedly
0.78
cited
0.78
therefore
0.78
briefed
0.74
also
0.71
unsuccessfully
0.71
additionally
0.71
phased
0.71
estimated
0.70
Activations Density 1.870%