INDEX
Explanations
ASCII art patterns and formats
symbols and special characters in the text
New Auto-Interp
Negative Logits
agre
-0.66
princ
-0.63
anwhile
-0.59
conservancy
-0.58
psychiat
-0.57
occas
-0.56
oulos
-0.56
conclud
-0.56
confir
-0.56
undermin
-0.53
POSITIVE LOGITS
É
0.68
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
0.68
||
0.60
-|
0.59
â̦â̦
0.59
__
0.57
~~~~
0.57
___
0.56
istg
0.56
||
0.55
Activations Density 0.433%