INDEX
Explanations
instances of communication or references to discussions within a forum or thread context
New Auto-Interp
Negative Logits
ipients
-0.15
-0.14
-0.14
auses
-0.14
åĩºçīĪ
-0.14
Tweet
-0.14
tweeting
-0.14
Wikimedia
-0.14
trie
-0.13
ieties
-0.13
POSITIVE LOGITS
thread
0.46
threads
0.44
Thread
0.42
forum
0.39
thread
0.38
/thread
0.36
-thread
0.36
posts
0.36
Thread
0.35
Threads
0.35
Activations Density 0.510%