INDEX
Explanations
phrases related to breathing
New Auto-Interp
Negative Logits
↵↵
-0.50
↵
-0.48
-0.48
tersebut
-0.47
<strong>
-0.45
-0.42
//
-0.42
-0.40
<em>
-0.38
<h1>
-0.38
POSITIVE LOGITS
Anal
0.94
Anal
0.88
iſchen
0.87
Analytic
0.85
Analy
0.85
دانشنامهٔ
0.85
ſeinen
0.84
niſſe
0.83
ſelbſt
0.81
ſicht
0.81
Activations Density 0.915%