INDEX
Explanations
references to specific individuals or groups in contexts of trust and opinion
tolerated condition
high-frequency function tokens and punctuation that mark clause boundaries, possession/attribution, and discourse transitions in sentences.
New Auto-Interp
Negative Logits
Hentet
-0.49
itake
-0.46
ftagPool
-0.45
awtextra
-0.44
parsedMessage
-0.43
contentLoaded
-0.43
مشارکتکنندگان
-0.42
gardens
-0.42
holi
-0.42
Qaraldi
-0.41
POSITIVE LOGITS
المعيارى
0.51
perverse
0.46
Toler
0.43
Toler
0.40
లాలు
0.38
len
0.37
tolerated
0.35
nestjs
0.35
compromised
0.35
ophiles
0.35
Activations Density 0.220%