INDEX
Explanations
terms related to environmental impact and pollution reduction
New Auto-Interp
Negative Logits
-0.62
...
-0.59
<<<<<<<<<<<<<<
-0.56
-
-0.56
للمعارف
-0.55
šinou
-0.51
ண்டு
-0.51
okuyayım
-0.49
–
-0.48
AccessorTable
-0.48
POSITIVE LOGITS
XNUMX
1.11
itſelf
0.94
myſelf
0.88
whoſe
0.88
houſe
0.88
NUMX
0.87
ſelf
0.87
leſs
0.82
becauſe
0.82
་་
0.81
Activations Density 0.032%