INDEX
Explanations
terms related to legal terminology and concepts
Tokens appearing next to mathematical symbols/variables
reverse chronological
New Auto-Interp
Negative Logits
Theſe
-1.56
myſelf
-1.50
Monfieur
-1.49
houſe
-1.43
pleaſure
-1.37
Efq
-1.36
ſelf
-1.36
Jefus
-1.33
Houſe
-1.31
purpoſe
-1.31
POSITIVE LOGITS
T
0.72
0.67
U
0.65
↵
0.64
C
0.64
di
0.63
l
0.62
W
0.62
<eos>
0.61
B
0.60
Activations Density 1.959%