INDEX
Explanations
sentence structures that establish definitions or descriptions
New Auto-Interp
Negative Logits
ThroughAttribute
-1.00
bootstrapcdn
-0.97
клопе
-0.91
Мексичка
-0.90
gainera
-0.90
Meksiku
-0.89
fevere
-0.89
myſelf
-0.87
ویکیپدیا
-0.85
OGND
-0.85
POSITIVE LOGITS
a
0.79
an
0.77
designed
0.67
based
0.66
is
0.65
intended
0.59
aimed
0.57
0.54
0.52
part
0.52
Activations Density 0.326%