INDEX
Explanations
references to opinions, feelings, and subjective assessments
that followed by specific words
New Auto-Interp
Negative Logits
RenderAtEndOf
-0.61
complexContent
-0.61
ftagPool
-0.53
➌
-0.52
FetchType
-0.51
EndContext
-0.50
CURIAM
-0.50
Rptr
-0.50
ویکیپدی
-0.49
Географиясе
-0.49
POSITIVE LOGITS
berdayakan
0.44
ambilan
0.41
orejas
0.40
conmigo
0.40
jarkan
0.39
rektur
0.38
vengan
0.38
ticias
0.38
وردار
0.38
desnuda
0.38
Activations Density 0.063%