INDEX
Explanations
the word "fictional" and related variations
words related to different types of institutions and their functions
New Auto-Interp
Negative Logits
uden
-0.75
================
-0.73
Blog
-0.72
KO
-0.70
RGB
-0.68
Blog
-0.67
ARDS
-0.65
使
-0.64
ALT
-0.64
gone
-0.62
POSITIVE LOGITS
ional
1.09
ities
1.01
izational
0.93
ism
0.91
ization
0.90
ized
0.88
isations
0.86
ised
0.86
acia
0.85
iott
0.85
Activations Density 0.010%