INDEX
Explanations
adjectives describing quantity or size
terms related to large-scale events or issues
New Auto-Interp
Negative Logits
Ïī
-0.84
present
-0.70
Downloadha
-0.70
ãĤ¼
-0.69
åĤ
-0.68
ãģ¾
-0.66
iphate
-0.65
thereof
-0.65
slot
-0.65
AX
-0.64
POSITIVE LOGITS
Examples
1.00
Thoughts
0.93
ly
0.93
Problems
0.90
Locations
0.90
Techniques
0.86
Categories
0.86
Features
0.84
Names
0.84
Stories
0.83
Activations Density 0.221%