INDEX
Explanations
pronouns that indicate possession
New Auto-Interp
Negative Logits
Immutable
-0.16
afi
-0.15
apus
-0.14
alie
-0.14
alist
-0.14
Stations
-0.14
ÑĪив
-0.14
áu
-0.14
egrator
-0.14
closets
-0.14
POSITIVE LOGITS
behalf
0.21
oose
0.17
esh
0.15
ogh
0.15
ission
0.15
-fly
0.14
ivic
0.14
ocz
0.14
spoiler
0.14
grounds
0.14
Activations Density 0.051%