INDEX
Explanations
references to people or entities involved in a situation or event
New Auto-Interp
Negative Logits
Sink
-0.21
sink
-0.20
sinks
-0.19
sinking
-0.17
ares
-0.16
478
-0.15
Sink
-0.15
plat
-0.15
raki
-0.15
ulp
-0.15
POSITIVE LOGITS
SOLE
0.15
.Style
0.15
ENU
0.15
aż
0.14
Brush
0.14
ptrdiff
0.14
Brush
0.14
brush
0.14
enty
0.14
AMED
0.14
Activations Density 0.012%