INDEX
Explanations
instances of the word "int" and its variations, indicating a focus on intensity or measurement-related terms
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.79
DT
-0.68
days
-0.67
phy
-0.66
phia
-0.65
brates
-0.65
pload
-0.65
millenn
-0.64
BLE
-0.64
76561
-0.64
POSITIVE LOGITS
ypes
1.10
ellect
0.91
eki
0.88
aro
0.85
zman
0.83
opia
0.83
ured
0.82
ered
0.82
illation
0.81
essential
0.81
Activations Density 0.005%