INDEX
Explanations
phrases containing the word "ours"
occurrences of a specific term or keyword repetitively throughout the text
New Auto-Interp
Negative Logits
Clarkson
-0.68
Metatron
-0.64
booth
-0.62
background
-0.59
EFF
-0.59
machine
-0.58
button
-0.57
xon
-0.57
Newton
-0.57
âĸł
-0.56
POSITIVE LOGITS
hip
1.22
urs
1.11
uits
1.09
uit
1.08
ury
1.06
geons
0.97
atile
0.97
ensical
0.95
cript
0.94
rences
0.93
Activations Density 0.007%