INDEX
Explanations
instances of the word "fer" with various context and capitalization
references to official or government-related topics
New Auto-Interp
Negative Logits
Decay
-0.72
ħĭ
-0.61
Bang
-0.60
Improvement
-0.59
havoc
-0.59
Products
-0.59
VP
-0.58
krit
-0.57
hift
-0.57
invent
-0.57
POSITIVE LOGITS
rer
1.12
ior
1.05
ring
1.04
dinand
1.01
ocious
1.00
andom
0.99
rers
0.96
rets
0.91
ministic
0.91
rett
0.86
Activations Density 0.011%