INDEX
Explanations
scientific terms or concepts
references to various phenomena or observed events
New Auto-Interp
Negative Logits
gur
-0.96
zar
-0.77
oÄŁ
-0.75
umenthal
-0.72
ãĤĮ
-0.72
zik
-0.72
ramid
-0.71
bitious
-0.69
endars
-0.67
pees
-0.67
POSITIVE LOGITS
phenomenon
0.91
ively
0.89
ually
0.88
whereby
0.88
occurring
0.86
ional
0.86
phenomena
0.80
ĸļ
0.79
icity
0.77
occurrence
0.77
Activations Density 0.040%