INDEX
Explanations
mentions of "Deep State."
New Auto-Interp
Negative Logits
فريبيس
-0.87
purpoſe
-0.85
greateſt
-0.84
myſelf
-0.83
ſame
-0.83
$_"
-0.81
occaf
-0.81
houſe
-0.80
itſelf
-0.79
neceffary
-0.79
POSITIVE LOGITS
deep
1.30
Deep
1.29
Deep
1.27
deep
1.09
profound
1.05
DEEP
0.95
DEEP
0.92
深
0.91
deeply
0.86
deepest
0.84
Activations Density 0.190%