zmq)

Jul 21, 2021

https://github.com/bitcoin/bitcoin/pull/22383

Host: jnewbery - PR author: jlopp

The PR branch HEAD was 78f4c8b98e at the time of this review club meeting.

Notes

Historic blocks that make up the block chain are stored on disk in blk?????.dat files. Those files each contain multiple serialized blocks, which consist of a header followed by the transactions contained in that block.
The transaction index is an optional index from transaction id (txid) to where that transaction is stored in the block files (file and offset), for all transactions that have been confirmed in a block. For normal operation of the node, a txindex is not required, and by default the txindex is disabled.
The getrawtransaction RPC is used to fetch a transaction from the node. When the verbose argument is set to false, it returns the hex-encoded serialized transaction. When verbose is set to true, it returns a JSON object containing information about the transaction.
getrawtransaction can retrieve the transaction in one of the following ways:
- mempool: unconfirmed transactions can be looked up by txid in the mempool.
- by block hash: if the user provides the hash of the block that the transaction is contained in, the block (including all transactions) is deserialized from disk, and the txid of each of the deserialized transactions is compared with the requested txid. If a matching transaction is found, it is returned to the user.
- using the txindex: if the node has txindex enabled, then the txid is looked up in the transaction index to find the location of the transaction on disk. The transaction is then deserialized from that location and returned to the user.
If the user provides the wrong block hash, then the call to getrawtransaction will fail.
If txindex is enabled and the correct block hash is provided, then performance will be much slower than if no block hash has been provided. This PR attempts to fix that so that there is no performance penalty for providing the block hash.

Questions

Did you review the PR? Concept ACK, approach ACK, tested ACK, or NACK?
Without looking at the code, why do you think that performance is worse when providing a block hash (assuming txindex is enabled)?
If we’re looking up the transaction by block hash, then what are the steps required? How do we know where to find that block on disk? How much data is deserialized?
If we’re looking up the transaction using the txindex, how much data is deserialized?
The first version of this PR included a behaviour change: when an incorrect block_index is provided to GetTransaction, but g_txindex->FindTx(hash, hashBlock, tx) finds and returns the tx. After this PR we would return the tx although it isn’t in the block the user asked for.

This behaviour change was removed in a subsequent push to the PR. Do you think the behaviour change is an improvement? Should it be included in this PR?
How can this PR be tested? Are any new test cases required?

Meeting Log

17:00

<jnewbery> #startmeeting

17:00

<willcl_ark> hi

17:00

<stickies-v> hi!

17:00

<larryruane> hi

17:00

<theStack> hi

17:00

<jnewbery> Hi folks! Welcome to Bitcoin Core PR Review Club

17:00

<raj_> hi

17:00

<emzy> hi

17:00

<dariusp> hi!

17:00

<glozow> hi!

17:01

<jnewbery> The review club is a place for people to come and learn about the process of contributing to Bitcoin Core. Everyone is welcome to come and ask questions

17:01

<jnewbery> Feel free to say hi to let people know you're here

17:01

<jnewbery> Anyone here for the first time?

17:01

<lopp> First time participant, reporting for duty!

17:01

<Sachin> hi

17:01

<jnewbery> lopp: welcome!!

17:02

<Azorcode> Hello everyone

17:02

<jnewbery> Notes and questions are here: https://bitcoincore.reviews/22383. I'll use the questions as prompts to guide the conversation, but feel free to jump in at any point if you have any questions or points you want to add

17:02

<sanket1729_> Hi

17:03

<jnewbery> Who had a chance to review the PR this week? (y/n)

17:03

<raj_> y

17:03

<unplanted> hi

17:03

<larryruane> y

17:03

<stickies-v> y

17:03

<neha> hi

17:03

<glozow> 0.5y

17:03

<theStack> y

17:03

<willcl_ark> y

17:03

<pglazman> y

17:03

<unplanted> n

17:03

<dariusp> n

17:03

<emzy> 0,5y

17:04

<jnewbery> Did you review the PR? Concept ACK, approach ACK, tested ACK, or NACK? And for those that reviewed the PR, can you briefly summarise what it's doing?

17:04

<Murch[m]> N

17:04

<Murch[m]> Well sort of

17:05

<raj_> Using txindex if enabled irrespective of weather blockhash is provided or not. But return failure provided blockhash dont match with found blockhash.

17:05

<larryruane> tested ACK -- summary: if the transaction index is enabled (`-txindex`), then use that to loop up a transaction, rather than reading a block from disk

17:06

<theStack> code-review ACK

17:06

<jnewbery> raj_ larryruane: correct!

17:06

<stickies-v> Approach ACK - it resolves unexpected performance loss where (when having a txindex) GetTransaction becomes slower when providing the optional block_index parameter

17:06

<jnewbery> stickies-v: correct

17:06

<jnewbery> 2. Without looking at the code, why do you think that performance is worse when providing a block hash (assuming txindex is enabled)?

17:07

<stickies-v> because the entire block instead of just the transaction gets deserialized

17:07

<raj_> De-serialization of the entire block takes more time than finding via txindex?

17:07

<jnewbery> (and if you've reviewed the PR, you can ignore the part about not looking at the code)

17:07

<theStack> the txindex is not used in this case, and the slow path of deserializing the whole block from disk is taken

17:08

<jnewbery> stickies-v raj_: I think you're right. Reading the block from disk and deserializing is probably what's causing the delay.

17:08

<jnewbery> theStack: right

17:08

<larryruane> also, is it likely that the parts of the txindex we need are already in memory (so no disk read needed)? (But the deserialization is the most important thing)

17:08

<unplanted> jnewbery would it be appropriate/relevant to visualize the slowdown with a fireplot?

17:08

<jnewbery> Did anyone do any profiling to see how long deserialization of a block takes?

17:09

<stickies-v> didn't run any code, not on my dev machine...

17:10

<sipa> jnewbery: that's also what i assume

17:10

<jnewbery> larryruane: the txindex points to the location of the transaction on the disk, so a disk read is always required for that part. You may be right about the txindex itself being in memory. I'm not familiar with the txindex code and how often it gets flushed or if there's any caching

17:11

<murch> I mean, don't you also need to linearly search through the whole transaction set of the block if you're looking it up from the block?

17:11

<jnewbery> next q: If we’re looking up the transaction by block hash, then what are the steps required? How do we know where to find that block on disk? How much data is deserialized?

17:12

<larryruane> jnewbery: the block index (also leveldb, like txindex) stores the blk file number and byte offset of the start of the block on disk

17:12

<larryruane> it's indexed by block hash

17:13

<jnewbery> larryruane: I would assume that it's indexed by txid if it's a transaction index

17:13

<larryruane> murch: great point, may have to scan over a thousand transactions (on average) to find it

17:13

<sipa> the txindex stores the file number, begin offset of that block, and begin offset of that tx

17:13

<sipa> iirc

17:13

<larryruane> jnewbery: no the block index

17:13

<unplanted> when I said fireplot I meant flame graphs ':]

17:14

<jnewbery> larryruane: the txindex is an index from txid -> (file, block pos, tx offset) , no?

17:15

<jnewbery> so you can look things up by txid

17:15

<theStack> murch: i guess once the block is deserialied into memory searching a tx doesn't take that long (in comparison), considering the limited number of txs in a block

17:15

<larryruane> jnewbery: yes, I thought you were asking about having the block hash but not the txindex

17:16

<jnewbery> ooooh sorry I misread your earlier message!

17:17

<jnewbery> yes, you're right about the block index being an index on the block hash

17:17

<larryruane> sipa: so the txindex allows you to seek directly to the tx on disk, do a small read (relative to an entire block), deserialize only that one tx .. so that's why it's much faster

17:17

<sipa> right

17:17

<jnewbery> back to the question: what are the steps currently when looking up a transaction with a provided block hash?

17:18

<jnewbery> it's kind of been answered already

17:18

<jnewbery> we deserialize the entire block, and then scan through each transaction looking for a match: https://github.com/bitcoin/bitcoin/blob/a3791da0e80ab35e862989373f033e5be4dff26b/src/validation.cpp#L1163-L1170

17:19

<murch> Well, you find the block, deserialize it and then look through the transactions until you find the one in question

17:19

<jnewbery> murch: exactly right

17:19

<glozow> get block from disk, deserialize it, deserialize all the transactions and compare their txids with the one requested until we find it

17:19

<larryruane> glozow: I think deserializing the block implies deserializing all the transaction it contains (right?)

17:19

<raj_> Find the block, find the transaction, see if txid matches, get back the blockhash and tx.

17:19

<glozow> larryruane: ya sorree

17:19

<jnewbery> theoretically you could stop deserializing part way through if you found a tx with the right txid, but we don't have a way of doing that

17:20

<unplanted> jnewbery we don't have a method to do that, or we haven't implemented one?

17:20

<jnewbery> but yes, it involves deserializing 1-2MB on average from disk into CBlock and CTransaction objects

17:21

<jnewbery> unplanted: there's no method to do that. I'm not suggesting that it'd be a good idea, but it's at least theoretically possible

17:22

<jnewbery> but if searching for transactions in serialized blocks was a performance critical operation, then stopping as soon as you deserialized the right transaction would be a reasonable operation

17:22

<jnewbery> *reasonable optimization

17:22

<jnewbery> ok, next question: If we’re looking up the transaction using the txindex, how much data is deserialized?

17:23

<murch> Since we know the offset of the tx data in the block, only the transaction itself

17:23

<raj_> The block header and the transaction only?

17:23

<jnewbery> murch: right

17:23

<larryruane> murch: i agree

17:23

<jnewbery> raj_: I don't think we even need to deserialize the block header, do we?

17:24

<murch> I think larryruane said earlier that the txindex stored the offset of the block in the block file and the offset of the tx respective to that, so all credit to him :D

17:24

<larryruane> jnewbery: I think that's correct -- with txindex, you don't even know which block the tx belongs to

17:24

<unplanted> jnewbery txindex is providing the offsets, so no I think

17:24

<larryruane> murch: no i was referring to the block index

17:24

<murch> Oh, well then I just guessed well

17:24

<larryruane> murch: oh it does?? i was accidentally right! haha

100

17:25

<murch> So would the txindex have the offset per blockfile?

101

17:25

<sipa> yes, it has the offset within the blk*.dat file where the tx starts

102

17:25

<raj_> jnewbery, yes but it seems its still reading the header and then seeking to the tx offset (or atleast thats how I am reading the code, so might be wrong)

103

17:25

<sipa> (and also the offset within that file where the block it contains starts)

104

17:26

<murch> So, it knows where the corresponding block starts, but does not explicitly store which block that is?

105

17:26

<jnewbery> raj_: I take it back. You're right!

106

17:26

<jnewbery> https://github.com/bitcoin/bitcoin/blob/a3791da0e80ab35e862989373f033e5be4dff26b/src/index/txindex.cpp#L242-L251

107

17:27

<Sachin> ```if (g_txindex->FindTx(hash, block_hash, tx)) {

108

17:27

<Sachin> if (!block_index || block_index->GetBlockHash() == block_hash) {

109

17:27

<Sachin> hashBlock = block_hash;

110

17:27

<Sachin> return tx;

111

17:27

<Sachin> }

112

17:27

<Sachin> }

113

17:27

<Sachin> does this code not return the blockhash of the tx?

114

17:27

<Sachin> or bind it to a pointer that is available?

115

17:27

<theStack> ah, TxIndex::FindTx() uses the header to find out and set the block hash

116

17:27

<jnewbery> Sachin: please don't paste code into the chat. You can paste a link to the code on github

117

17:28

<Sachin> my bad, thank you

118

17:28

<jnewbery> no problem! It just gets a bit noisy if we paste code in here

119

17:28

<raj_> Sachin, I think the hash is returned in that `hashBlock` variable.

120

17:29

<larryruane> TIL after it deserializes the header (`file >> header`), the file offset is immediately _after_ the header, so that's a relative seek to the transaction offset .. cool

121

17:29

<jnewbery> Right, FindTx() returns the block hash, which it gets from deserializing the header and hashing it

122

17:29

<raj_> jnewbery, is the reason that it still reads the block header is that tx offset starts counting after the header?

123

17:29

<Sachin> yeah, that's what I thought but I wanted to confirm. So the user can still determine the blockhash after this call

124

17:30

<glozow> right: https://github.com/bitcoin/bitcoin/blob/54e31742d208eb98ce706aaa6bbd4b023f42c3a5/src/index/txindex.cpp#L255

125

17:30

<sipa> murch: correct, i think (re: "does not explicitly store which block that is")

126

17:30

<murch> sipa: thanks

127

17:31

<jnewbery> raj_: the reason is that it needs to calculate the block hash from the block header. If it didn't need to do that, the index could just store the offset of the tx from the beginning of the file

128

17:31

<jnewbery> alright, next question: The first version of this PR included a behaviour change: when an incorrect block_index is provided to GetTransaction, but g_txindex->FindTx(hash, hashBlock, tx) finds and returns the tx. After this PR we would return the tx although it isn’t in the block the user asked for.

129

17:31

<jnewbery> This behaviour change was removed in a subsequent push to the PR. Do you think the behaviour change is an improvement? Should it be included in this PR?

130

17:33

<raj_> It doesn't seem to be an improvement to me. If the user provided wrong blockhash its better to notify that.

131

17:33

<murch> No, I think it shouldn't. When you explicitly filter something for a specific block it would be odd to get data back that doesn't fit the search parameters

132

17:33

<theStack> generally i'd argue it's more clean to separate performance optimizations and behavioural changes into different PRs, to not mix things up and lower review burden

133

17:33

<murch> If the user doesn't know which block to expect it in, they shouldn't/wouldn't restrict the search to that block.

134

17:34

<larryruane> jnewbery: I know you've got more questions, but if I may inject a question here: what is txindex used for; why do people sometimes enable it? One answer I can think of is: block explorers .. but maybe there are other reasons?

135

17:34

<murch> It could perhaps give a different error message to be more helpful to the user though.

136

17:34

<murch> "Tx found in a different castle"

137

17:35

<jnewbery> larryruane: great question! Anyone have any thoughts about that? Why do people enable txindex?

138

17:35

<larryruane> murch: ".. it would be odd to .." I agree completely

139

17:35

<willcl_ark> It's easier to look up transactions which aren't related to your wallet, on your local node?

140

17:35

<larryruane> obviously not for validity checking, or txindex wouldn't be optional!

141

17:35

<raj_> To quickly find lots of transaction via txid?

142

17:36

<jnewbery> theStack: Yes, regardless of whether it's an improvement or not, separating performance improvements from behavioural changes into different PRs is just good hygiene

143

17:36

<murch> jnewbery: To run additional services on top of bitcoin

144

17:36

<glozow> based on what the rpc docs suggest, `getrawtransaction(txid, blockhash)` means "look for this tx in this block" so if it's not in the block and a tx is returned, seems misleading

145

17:36

<pglazman> In the case of duplicate TxIDs that BIP30 addresses, would providing a block_hash help filter the expected transaction?

146

17:36

<larryruane> theStack: what's your opinion about separating those into different commits but still same PR? I've seen that done (i think)

147

17:36

<stickies-v> raj_: I don't even think it's about performance, without txindex I don't think you can search for transactions (not in your wallet) at all?

148

17:36

<jnewbery> murch raj_: I agree. Failing the request but returning a more specific error seems optimal

149

17:37

<jnewbery> pglazman: great question! Has anyone tried using getrawtransaction with either of the duplicate txids?

150

17:37

<raj_> stickies-v, ya makes sense, unless the user also happens to know the blockhash.

151

17:37

<jnewbery> either with or without the txindex enabled

152

17:37

<theStack> glozow: +1

153

17:38

<sipa> pglazman: i believe it will only find the latter transaction copy

154

17:38

<sipa> because the later one overwrites the former one

155

17:38

<sipa> (when using txindex)

156

17:38

<raj_> jnewbery, without txindex we would only get mempool txs right?

157

17:38

<larryruane> sipa: yes because leveldb isn't a multimap (can it even be?)

158

17:39

<sipa> no

159

17:39

<sipa> it's a key-value store, and the key is the txid

160

17:39

<jnewbery> does this PR prevent us from accessing the first one if txindex is enabled, even when providing the correct block hash?

161

17:39

<larryruane> sipa: got it, thanks

162

17:39

<lopp> jnewbery: it seems to me that anyone who has txindex enabled wouldn't want to bother searching by blockhash, though it could come in handy if you are looking for transactions in orphaned blocks.

163

17:39

<sipa> jnewbery: i suspect so

164

17:40

<sipa> i'm not sure we care for those two historical oddities...

165

17:40

<theStack> larryruane: i think behvaioural and optimizing changes should always at least separated by commits, but generally i'd prefer different PRs; in a single PR some people (e.g. people deep into optimization, but not caring so much about behaviour) can only "half-ACK" the PR :p

166

17:40

<jnewbery> I'm not sure what different information would be returned from the first and second ones. What block contextual information does getrawtransaction return?

167

17:41

<larryruane> theStack: +1

168

17:41

<murch> Just the blockheight maybe?

169

17:41

<jnewbery> sipa: agree that in practice, we probably don't care. It's a good test case though

170

17:41

<murch> and the hash of the block it was included in

171

17:41

<murch> The tx itself would obviously be immutable, since if it had been malleated, it would have a different txid

172

17:42

<glozow> jnewbery: also # confirmations and block time if verbose=true, it seems

173

17:42

<larryruane> for us newbies, are there 2 tx on the mainnet blockchain with the same txids? (but i assume different wtxids) And only 2?

174

17:42

<jnewbery> If anyone is scratching their heads about "multiple transactions with the same txid", it's all explained in BIP 30: https://github.com/bitcoin/bips/blob/master/bip-0030.mediawiki

175

17:43

<sipa> larryruane: yes, exactly two; no more, no less

176

17:43

<jnewbery> larryruane: 2 pairs, so 4 transactions

177

17:43

<sipa> oh right, 2 times 2

178

17:43

<jnewbery> or 2 transactions, depending on how you define transaction :)

179

17:44

<jnewbery> There's one more question: How can this PR be tested? Are any new test cases required?

180

17:44

<Sachin> larryruane https://github.com/bitcoin/bips/blob/master/bip-0030.mediawiki

181

17:44

<raj_> sipa, Where can i read more about these 2 and how/why the occurred?

182

17:44

<Sachin> raj_ see my link

183

17:44

<theStack> raj_: see jnewbery's link to BIP30 above

184

17:45

<murch> Yeahhttps://bitcoin.stackexchange.com/a/75301/5406

185

17:45

<raj_> Sachin, theStack Thanks..

186

17:46

<jnewbery> any thoughts about testing?

187

17:46

<theStack> as far as i remember the first version of the PR with behavioural changes still passed the CI; so there seem to be test coverage missing related to GetTransaction

188

17:46

<theStack> jonatack opened a PR with that

189

17:46

<raj_> Jonatack has wrote some functional functional tests, yet to go through them..

190

17:46

<lopp> this PR seemed to reveal a lack of testing around expected behavior of this function

191

17:46

<larryruane> jnewbery: There's another PR to improve the functional testing of this RPC, https://github.com/bitcoin/bitcoin/pull/22437

192

17:47

<theStack> lopp: +1

193

17:47

<murch> lopp: Did you ever find out what caused the functional test hiccough?

194

17:47

<jnewbery> lopp: I agree! It's always a shame when you change behaviour and nothing breaks

195

17:47

<larryruane> jnewbery: Seems like RPCs like this should be unit-testable, but `src/test/rpc_tests.cpp` seems to mostly test just argument processing

196

17:48

<sipa> damn tests, they're terrible. their sole purpose in life is to *fail* at the right time, and they can't even do that correctly

197

17:48

<lopp> murch: oddly enough, the first few commits kept failing and then suddenly started working despite my changes being cosmetic

198

17:48

<murch> Right, even though I could even reproduce the test failure locally

199

17:49

<jnewbery> our rpc_tests unit tests are very limited

200

17:49

* murch scratches head

201

17:49

<larryruane> jnewbery: but could they be much better?

202

17:49

<lopp> I assumed it was a CI issue because my tests passed locally

203

17:49

* unplanted makes a test of a test

204

17:49

<larryruane> (i'm not sure if they provide enough framework to be able to do more)

205

17:50

<jnewbery> larryruane: I generally think we use the functional tests too much when a unit test would be more appropriate. In this case though, I think the functional tests are fine. This functionality involves a lot of components (rpc, validation, txindex), and we just don't have the unit test framework to mock all those pieces and isolate the behaviour we want to test

206

17:50

<larryruane> I think some of the refactoring going on (such as eliminating globals) will make RPCs more unit-testable ... (?)

207

17:50

<murch> sipa: Yeah, we should really get rid of them on grounds of underperforming

208

17:51

<larryruane> :laugh:

209

17:51

<murch> Although I'd say we should look into hiring replacements

210

17:51

<jnewbery> larryruane: yes, there's ongoing work to elimate globals which would make all of our components more testable. It's really important work

211

17:51

<murch> lopp: Mh, they did fail for me locally as well.

212

17:52

<murch> I didn't try again after they passed in the build system, tho

213

17:52

<jnewbery> Was anyone surprised that GetTransaction() is in validation.cpp? It seems to me that node/transaction.cpp would be a more appropriate place for it.

214

17:53

<raj_> jnewbery, +1

215

17:53

<stickies-v> agreed!

216

17:54

<glozow> jnewbery ya

217

17:54

<jnewbery> seems weird that validation would call into txindex. I wonder if we remove this function, then validation would no longer need to #include txindex

218

17:54

<sipa> GetTransaction predates node/transaction.cpp, and even the generic index framework itself :)

219

17:55

<sipa> (before 0.8, validation itself used the txindex)

220

17:55

<jnewbery> (and GetTransaction() seems like a natural sibling to BroadcastTransaction(), which is already in node/transaction.cpp)

221

17:55

<jnewbery> sipa: right, this is not meant as a criticism of course. Just wondering if we can organize things a bit more rationally now that we have better separation between things.

222

17:55

<sipa> jnewbery: sure, just providing background

223

17:56

<sipa> seems very reasonable to move it elsewhere now

224

17:56

<jnewbery> ok, any other questions/thoughts before we wrap this up?

225

17:56

<larryruane> one small thing,

226

17:57

<larryruane> does rolling back the blockchain (reorg) need to update txindex?

227

17:57

<jnewbery> larryruane: great question!

228

17:57

<larryruane> i guess it must need to.. maybe using the rev files?

229

17:57

<murch> larryruane: I would certainly expect that it does!

230

17:57

<sipa> it doesn't need to

231

17:57

<sipa> it can report transactions in non-active chains, as long as the transaction wasn't re-confirmed in the main chain

232

17:58

<larryruane> oh that's cool!

233

17:58

<murch> sipa: Right, so wouldn't it weird to get a tx with a blockhash that might not be confirmed in the best chain?

234

17:58

<lopp> I'd expect txindex would remain the same until said transaction got confirmed in a new block, at which point the txindex pointer would get updated

235

17:58

<sipa> murch: eh, maybe :)

236

17:58

<glozow> murch: getrawtransaction tells you if it's in the active chain or not

237

17:58

<murch> mh

238

17:58

<murch> TIL

239

17:59

<sipa> i assume it's pretty much unused functionality, but it has always worked that way

240

17:59

<jnewbery> oh, I have one more question. Now that we're all experts in GDB thanks to larry's tutorial last week, did anyone step through the code using a debugger?

241

17:59

<larryruane> i did! haha

242

17:59

<jnewbery> larryruane: 🥇

243

18:00

<raj_> I plan to with jonatack's test tomorrow. :)

244

18:00

<jnewbery> Oh, there are also some gdb notes from last week that larry sent me, which I'll add to last week's meeting page