INDEX
Explanations
dates written in a specific format - "âĢĵ" followed by numbers representing years
the occurrence of a specific character or symbol sequence in the text
New Auto-Interp
Negative Logits
iple
-0.68
ysis
-0.67
Borg
-0.64
proced
-0.64
rooting
-0.62
redress
-0.61
Blu
-0.60
ppe
-0.60
QC
-0.59
Starr
-0.59
POSITIVE LOGITS
20439
0.97
Pg
0.92
advertisement
0.91
_>
0.89
––
0.88
女
0.87
coll
0.86
Lie
0.85
PRES
0.83
advertising
0.83
Activations Density 0.040%