INDEX
Explanations
phrases containing the word "with."
New Auto-Interp
Negative Logits
purpoſe
-0.81
houſe
-0.79
ſte
-0.78
pleaſure
-0.77
poffible
-0.77
enfans
-0.75
faſt
-0.72
ſtate
-0.71
ſelf
-0.71
ſta
-0.71
POSITIVE LOGITS
with
1.14
WITH
0.99
with
0.98
With
0.95
With
0.94
WITH
0.91
avec
0.88
dengan
0.82
עם
0.82
dengan
0.81
Activations Density 0.399%