INDEX
Explanations
phrases that indicate opinions or advice about improving situations or making choices
"it" followed by a descriptive adjective
New Auto-Interp
Negative Logits
abstractmethod
-0.48
plazos
-0.47
calcetines
-0.44
abbandon
-0.40
parcial
-0.38
prazo
-0.37
contenedores
-0.36
profesores
-0.35
agujas
-0.35
gobern
-0.34
POSITIVE LOGITS
ujednoznacz
0.75
⟬
0.75
become
0.69
becoming
0.66
Vidite
0.65
Becoming
0.65
make
0.65
Becoming
0.64
becomes
0.62
__((
0.60
Activations Density 0.111%