INDEX
Explanations
proper nouns related to companies or organizations
mentions of "Big" followed by numerical values, particularly those referring to groups or organizations
New Auto-Interp
Negative Logits
confir
-0.94
idency
-0.87
anwhile
-0.86
yrim
-0.82
Downloadha
-0.78
theless
-0.77
veyard
-0.73
guiActiveUn
-0.73
istry
-0.72
izabeth
-0.71
POSITIVE LOGITS
gest
1.29
ger
1.14
glers
0.87
ging
0.86
gers
0.85
gins
0.84
Brother
0.83
gie
0.83
Integer
0.81
Daddy
0.78
Activations Density 0.017%