INDEX
Explanations
mentions of legal and political actions or decisions
instances of numerical values or ratings
New Auto-Interp
Negative Logits
Ĥ
-0.90
©
-0.85
poon
-0.78
¡
-0.73
¥µ
-0.72
¬
-0.70
baugh
-0.69
Ĥ¬
-0.68
Ĭ
-0.67
ī
-0.67
POSITIVE LOGITS
FILE
1.22
Fresh
0.76
Returns
0.71
³³³³³³³³
0.70
Actor
0.69
Shell
0.69
IJ
0.69
âĵĺ
0.68
Credit
0.67
article
0.67
Activations Density 0.076%