INDEX
Explanations
references to specific locations and actions associated with governance and environmental impact
New Auto-Interp
Negative Logits
endoza
-0.16
enge
-0.15
TN
-0.15
ä¹¾
-0.15
rrha
-0.15
reur
-0.15
rdr
-0.15
unger
-0.15
iasi
-0.14
梨
-0.14
POSITIVE LOGITS
Goa
0.38
Portuguese
0.30
Go
0.26
Portug
0.25
/go
0.23
Go
0.22
Portugal
0.22
go
0.21
GO
0.20
Curt
0.20
Activations Density 0.021%