INDEX
Explanations
references to specific named entities like people, organizations, and locations
New Auto-Interp
Negative Logits
Ö¼
-0.52
È
-0.49
ĸ
-0.45
ļ
-0.45
istor
-0.45
*)
-0.44
··
-0.44
20439
-0.44
ioxide
-0.44
Examination
-0.44
POSITIVE LOGITS
respectively
0.65
apiece
0.59
Pac
0.46
rael
0.43
built
0.41
chedel
0.41
Together
0.40
Hz
0.40
Fal
0.39
ijah
0.39
Activations Density 2.468%