INDEX
Explanations
questions or phrases that seek clarification or identification of subjects and facts
New Auto-Interp
Negative Logits
continuant
-0.55
utnik
-0.53
teme
-0.52
pollici
-0.50
AddAttribute
-0.49
ין
-0.47
dientes
-0.46
DbType
-0.46
punkte
-0.46
liberi
-0.46
POSITIVE LOGITS
what
0.82
WHAT
0.81
__(/*!
0.76
Hva
0.76
Hvad
0.74
WHAT
0.72
Hvem
0.71
saraba
0.70
what
0.70
Hvad
0.69
Activations Density 0.121%