INDEX
Explanations
concepts related to social issues and personal values
New Auto-Interp
Negative Logits
apan
-0.48
enumi
-0.45
Ni
-0.44
Nicole
-0.43
Met
-0.42
Ge
-0.42
Me
-0.42
ayangkan
-0.42
Pan
-0.41
Dr
-0.40
POSITIVE LOGITS
OGND
0.54
UnitTesting
0.50
jspb
0.50
autorytatywna
0.50
ujednoznacz
0.48
Personendaten
0.48
=*/
0.46
Diweddarwch
0.45
]")]
0.44
pinulongan
0.43
Activations Density 0.259%