INDEX
Explanations
possessive pronouns and articles followed by nouns
New Auto-Interp
Negative Logits
بشكل
1.02
dysfunctional
0.91
blurry
0.87
arguably
0.86
iteratively
0.84
standalone
0.83
dystopian
0.82
minimalist
0.81
overseen
0.81
neoliberal
0.80
POSITIVE LOGITS
splendid
0.72
kindred
0.69
exig
0.68
great
0.67
countenance
0.64
wretched
0.63
সহিত
0.63
heathen
0.62
わざ
0.62
outlay
0.60
Activations Density 0.049%