INDEX
Explanations
notes or annotations within text
instances of the word "Note" or variations of it
New Auto-Interp
Negative Logits
gut
-0.73
quit
-0.65
hust
-0.64
tactics
-0.63
volunteers
-0.63
pal
-0.62
fight
-0.61
open
-0.61
ulic
-0.61
adian
-0.60
POSITIVE LOGITS
Note
3.48
Note
2.33
note
2.29
NOTE
2.28
Notes
1.94
Notice
1.86
NOTE
1.83
note
1.64
Important
1.53
Warning
1.47
Activations Density 0.018%