INDEX
Explanations
proper nouns related to a specific person or location, potentially related to news or events
specific names or references, particularly related to people or organizations
New Auto-Interp
Negative Logits
ãģ¦
-0.90
GEAR
-0.69
Yel
-0.69
exha
-0.69
TABLE
-0.69
ces
-0.67
antidepressants
-0.65
IGF
-0.65
resil
-0.65
unanim
-0.65
POSITIVE LOGITS
cht
1.01
mann
0.95
ronics
0.95
fried
0.88
itect
0.86
ung
0.86
geist
0.84
iness
0.83
idy
0.82
ijn
0.80
Activations Density 0.004%