INDEX
Explanations
references to particular sections in a text structure
references to ongoing stories or narratives in a document
New Auto-Interp
Negative Logits
»Ĵ
-0.74
ŃĶ
-0.63
fame
-0.62
ibilities
-0.61
«ĺ
-0.60
abstinence
-0.59
eco
-0.58
Ĥ¬
-0.57
uits
-0.57
DEM
-0.56
POSITIVE LOGITS
Continued
0.86
tenance
0.77
caption
0.71
hart
0.70
telling
0.68
Trend
0.63
"></
0.62
advertisement
0.62
biz
0.62
line
0.62
Activations Density 0.008%