INDEX
Explanations
references to inclusion within a group or category
New Auto-Interp
Negative Logits
inter
-0.67
Reſ
-0.53
لينكات
-0.52
Eſ
-0.52
ISNI
-0.51
تقاوى
-0.51
Conſ
-0.51
ſte
-0.50
ſta
-0.50
متعلقه
-0.50
POSITIVE LOGITS
among
1.53
Among
1.51
Among
1.42
among
1.36
AMONG
1.27
parmi
1.16
amongst
1.05
blandt
1.04
Amongst
1.03
Parmi
0.97
Activations Density 0.066%