INDEX
Explanations
specific formats or patterns within the document text
special characters and formatting elements in the text
New Auto-Interp
Negative Logits
iller
-0.70
fish
-0.67
Ridley
-0.67
divers
-0.67
nery
-0.64
illary
-0.64
Eye
-0.63
illed
-0.63
Wiggins
-0.63
Kitt
-0.63
POSITIVE LOGITS
------------
1.17
--------------
1.12
---------------
1.12
-------------
1.09
==
1.08
====
1.07
---
1.07
===
1.06
---------
1.03
--------------------------------------------------------
1.02
Activations Density 0.024%