INDEX
Explanations
phrases indicating quantities or assessments related to significance or importance
"a" followed by a descriptive adjective
amount or quantity
New Auto-Interp
Negative Logits
ModelAdmin
-0.68
صوتيه
-0.59
instructions
-0.58
المعيارى
-0.57
instruction
-0.57
negroes
-0.57
savages
-0.56
บัติ
-0.56
petals
-0.55
يتيمه
-0.54
POSITIVE LOGITS
variety
0.66
Vielzahl
0.63
myriad
0.63
multitude
0.59
greater
0.59
plethora
0.58
range
0.56
PerformLayout
0.56
broad
0.55
sense
0.55
Activations Density 0.634%