libs)

Jan 18, 2023

https://github.com/bitcoin/bitcoin/pull/26697

Host: larryruane - PR author: LarryRuane

The PR branch HEAD was 40e1923e856661fdf68cf783ed9c6d1931dfbdf3 at the time of this review club meeting.

Notes

The logging facility of bitcoind writes debug and informational messages to debug.log at the top level of the data directory. A brief description can be found in the developer notes.
LogPrintf() writes to debug.log unconditionallly. These messages are considered important enough to always be written.
LogPrint() takes a category as its first argument, and only writes to debug.log if the specified category is enabled. The list of categories is defined by enum LogFlags.
The user can enable a logging category, for example NET (p2p messages) by specifing the configuration option debug=net. The strings are defined by [LogCategories](https://github.com/bitcoin/bitcoin/blob/0c2a1288a210916254c939d080ce12ffc5547841/src/logging.cpp#L150) but the mapping from the enum category symbols to strings is trivial.
The logging categories can also be enabled and disabled at runtime using the logging RPC.
Multiple logging categories can be enabled simultaneously by specifying multiple debug= config options.
PR #9424 changed the logging categories from a set of strings (like "net") to a enumeration type (NET).

Questions

Did you review the PR? Concept ACK, approach ACK, tested ACK, or NACK?
What does this PR do, and what is the problem it proposes to address?
What are the advantages of std::bitset compared with bit manipulations on integers?
An earlier attempt, PR #26619, was closed in favor of the current PR. What are some advantages and disadvantages of each approach?
There are two special logging categories, ALL and NONE. Why are these needed?
Logger::m_categories was wrapped by std::atomic. Why was that needed? Why is it no longer needed?
Following on to that question, some concern was expressed on the PR about performance, and the need for benchmarking. Why would this be important?
An unknown P2P message does not cause a LogPrintf() message. Why not? Wouldn’t it be helpful to see if a peer is sending us a message we don’t understand?
Does the same principle apply to RPC, REST, or ZMQ requests?
The Logger::WillLogCategory is annotated with const; what does this mean? How is it possible to modify the object by locking m_cs?

Meeting Log

17:00

<LarryRuane> #startmeeting

17:00

<effexzi> Hi every1

17:00

<stickies-v> hi

17:00

<emzy> hi

17:00

<kouloumos> hi

17:00

<brunoerg> hi

17:00

<Yaz> hi

17:00

<lightlike> hi

17:00

<codo> hi

17:00

<LarryRuane> Hi everyone, please feel free to say hi

17:01

<LarryRuane> If you just want to "lurk" that's fine too!

17:01

<rozehnal_paul> hi

17:01

<LarryRuane> Is anyone here new to review club? Please feel free to introduce yourself

17:01

<Yaz> Hi, my name is Yazid, an Industrial Engineer, looking to strengthen myself in bitcoins source code:)

17:02

<LarryRuane> Reminder, if anyone is interested in hosting review club, just let me or @stickies-v or @glozow know, that would be great!

17:02

<dzxzg> Hi, second time at review club, just here to watch

17:02

<LarryRuane> Yaz: hi! welcome, glad you're here!

17:02

<coreyphillips> hi. I'm new. Will mostly lurk for now.

17:02

<LarryRuane> dzxzg: welcome to you too, glad you're back! :)

17:02

<b_101> hello every1

17:03

<LarryRuane> hello @coreyphillips great to have you here, feel free to lurk! Or ask questions!

17:03

<LarryRuane> Today's review is: https://bitcoincore.reviews/26697

17:04

<LarryRuane> We'll go through the questions, but feel free to continue discussing anything we've already covered

17:04

<LarryRuane> So let's begin, Did you review the PR? Concept ACK, approach ACK, tested ACK, or NACK?

17:04

<stickies-v> Approach ACK

17:05

<codo> Tested ACK

17:05

<emzy> Concept and tested ACK (not good at C++) :)

17:05

<LarryRuane> Oh I should also ask, does anyone have any questions or comments about the Notes? Anything you'd like to add?

17:06

<codo> The last part of the last Q I did not understand, but maybe I will if we come to that.

17:06

<glozow> For people who reviewed, how did you go about doing so?

17:07

<emzy> I first git clone it and compile it. Read the conversation and test it. Then try to understand a bit the code.

17:08

<roze_paul> I just read over the file changes and comments, going back to maxwell's PR in late 2016...but i didn't test...AFAIK all the testing added amounts to one in-line code (asserts)

17:08

<emzy> try to figure out what to test. I'm more testing then doing code review.

17:09

<emzy> That is manual testing.

17:09

<LarryRuane> good! Did anyone try the `logging` rpc? It's cool how you can enable and disable logging categories on the fly

17:09

<codo> I wrote down what I did in a comment in the PR: https://github.com/bitcoin/bitcoin/pull/26697#issuecomment-1387281337

17:09

<kouloumos> I haven't finish looking at the implementation yet, but what I started with is gathering context about how the change came to be, other related logging changes, other usages of bit operation in the codebase and looked a bit into the performance

17:09

<emzy> Yes, did that.

17:10

<emzy> Tested the RPC logging enable and disable on the fly. Very usefull.

17:10

<LarryRuane> those all are great! It's something I struggle with myself, how to go about reviewing a PR (there are so many things one can do)

17:10

<LarryRuane> let's go to Q2, What does this PR do, and what is the problem it proposes to address?

17:11

<codo> There was a limit on the number of possible logging categories. This PR removes that limit.

17:11

<roze_paul> it extends the number of log-topics available to an arbitrary amount by using the std::bitset function

17:12

<LarryRuane> roze_paul: yes, although it's actually a type

17:12

<emzy> Before it there was a bit set or unset to set the categories. That was limitet by the size of the interger used.

17:13

<LarryRuane> did you notice that `std::bitset` has a fixed size (number of bits)? the fact that the number of bits is specified within angle brackets indicates that

17:14

<LarryRuane> https://github.com/bitcoin-core-review-club/bitcoin/commit/40e1923e856661fdf68cf783ed9c6d1931dfbdf3#diff-21abb6b14af1e9330a6f0c89a87231035a439248c556ef5e110eb0617b88a1f4R77 (`ALL` is a constant)

17:14

<LarryRuane> emzy: +1

17:14

<brunoerg> So, `ALL` defines the maximum size?

17:15

<LarryRuane> yes, which is ... a little confusing! but it probably makes sense to do that identifier rather than create another one to indicate size

17:16

<brunoerg> makes sense

17:16

<LarryRuane> Q3 What are the advantages of std::bitset compared with bit manipulations on integers?

17:17

<andrew_m_> it has more internal methods?

17:17

<roze_paul> we get to get rid of the manual bitshifting code...which i don't understand why this is such a huge advantage, but it was stated in the notes as an advantage

17:17

<b_101> will size (ALL) wll change everytime a new log category gets added?

17:17

<emzy> It will be grow with more options for categories.

17:17

<lightlike> code looks cleaner, no need to manage the bits of an integer with "1 << 4" and such.

17:17

<roze_paul> other than making the code more concise

17:17

<roze_paul> +1 b_101

17:17

<LarryRuane> b_101: yes, but automatically

17:18

<LarryRuane> lightlike: yes, there may be fewer code conflicts to resolve (like if two PRs allocate the same bit)... although those aren't too hard to resolve

17:18

<emzy> I think it make the code more high level, without the bit shifting.

17:19

<roze_paul> another adv: we now get to utilize std::bitset's built-in functions like set() and reset()

17:19

<LarryRuane> here's what @lightlike is referring to: https://github.com/bitcoin-core-review-club/bitcoin/commit/40e1923e856661fdf68cf783ed9c6d1931dfbdf3#diff-21abb6b14af1e9330a6f0c89a87231035a439248c556ef5e110eb0617b88a1f4L44

17:19

<LarryRuane> emzy: yes, I like that about it too

17:20

<roze_paul> i can't recall if the version we are replacing also used a form of set and reset() ??

17:20

<LarryRuane> roze_paul: yes, I think those are conceptually more clear than `0` and `~0` :)

17:20

<kouloumos> test() is also cool

17:20

<LarryRuane> roze_paul: i think it does bitwise and (`&`) and or (`|`)

17:21

<LarryRuane> also for me, the type `uint32_t` is generic, just seeing that type doesn't indicate what how it's being used

17:21

<brunoerg> seems elegant set() and reset() for enabling and disabling categories, it's a good reason..

17:22

<LarryRuane> although that (`uint32_t` being too generic) could be improved with `using LoggingFlags = uint32_t` (or `typedef uint32_t LoggingFlags`)

17:22

<LarryRuane> (i think `using` is preferred)

17:22

<emzy> set() and reset() sounds more like C++ than C to me. :)

17:23

<LarryRuane> on the other hand, `std::bitset` may reveal too much of the underlying representation... but I don't think so because could be _conceptual_ bits (flags)

17:23

<emzy> So it fits better in my mind.

17:23

<LarryRuane> emzy: yes me too

17:24

<codo> is it also more portable?

17:24

<LarryRuane> codo: good point, it's definitely more abstract, so yes, I'd say more portable (hadn't thought of that)

17:25

<roze_paul> by portable, we mean between architectures && machines?

17:26

<LarryRuane> (i'll go on but again, feel free to keep discussing previous questions)

17:26

<LarryRuane> we kind of covered this already, but Q4: An earlier attempt, PR #26619, was closed in favor of the current PR. What are some advantages and disadvantages of each approach?

17:26

<LarryRuane> roze_paul: right, although i would say `uint32_t` is definitely the same everywhere

17:27

<LarryRuane> https://github.com/bitcoin/bitcoin/pull/26619

17:27

<emzy> I think the only change was using a bigger integer. So only extend it to 64 options.

17:28

<roze_paul> re. q4: the previous approach (26619) required less work, in that there was no approach change, just a change from 32 to 64 bit integers...in that sense it probably required less testing and review, which is an advantage

17:28

<brunoerg> #26619 you're just increasing the limit but not making it flexible like the new approach?

17:28

<LarryRuane> emzy: yes, smaller diff, less review and risk

17:29

<LarryRuane> brunoerg: right, it's conceivable that we could need more than 64 logging categories in the future (new subsystems or make them more fine-grained)

17:29

<LarryRuane> make existing categories more fine-grain

17:30

<LarryRuane> roze_paul: +1

17:31

<LarryRuane> I think another advantage of the closed 26619 is that we can still use std::atomic (I couldn't figure out how to wrap `std::bitset` with `std::atomic` but maybe there's a way)

17:32

<kouloumos> although "levels" seems to be targeting that fine-graining, right?

17:33

<LarryRuane> kouloumos: yes, I guess that can serve that purpose too, good point (i personally don't like levels, I've always like categories only)

17:33

<LarryRuane> let's try Q5 There are two special logging categories, ALL and NONE. Why are these needed?

17:34

<roze_paul> @larry do you use trace levels, to get all the data, and then filter the logging data yourself...just thinking of a way to work around using levels..

17:35

<roze_paul> Q5: not entirely sure, but i think calling all will turn on all logging topics, and none will turn of (bitset.reset) all topics...ALL also conveys the number of total number of topics, if one wants that info

17:36

<LarryRuane> Yes you could do that, I think the default is to get all the logging for a given category

17:37

<LarryRuane> well I think `ALL` being the total number of categories is just a code-internal thing, not anything the user is aware of

100

17:37

<LarryRuane> For Q5, i would say `debug=all` is more convenient than `-debug=net -debug=tor -debug=mempool ...`

101

17:38

<roze_paul> most definitely more convenient

102

17:38

<brunoerg> it's also easier to enable and disable all categories

103

17:38

<LarryRuane> the functional tests do this, because they assume that by default you would want to see all categories, AND, there's one other reason, anyone know why?

104

17:38

<LarryRuane> brunoerg: +1 that's a good point

105

17:38

<kouloumos> I think `NONE` indicates the unconditional logging done with `LogPrintf()`

106

17:39

<LarryRuane> kouloumos: i think that would be `ALL`, rather than `NONE`

107

17:40

<LarryRuane> well I'm not sure, maybe that's wrong, need to think about it!

108

17:41

<LarryRuane> notice you can't write `LogPrint(ALL, ...)` because those calls are always specified to a particular category

109

17:42

<kouloumos> I think that's what this implies https://github.com/bitcoin/bitcoin/blob/0c2a1288a210916254c939d080ce12ffc5547841/src/logging.h#L236

110

17:43

<LarryRuane> anyway, I think the other reason the functional tests enable all categories is to test the `LogPrint` calls! It would be bad if you enabled a category, and the `LogPrint` dereferenced a null pointer or something and crashed the process

111

17:44

<LarryRuane> kouloumos: Oh i see, you're right! thanks

112

17:45

<LarryRuane> we kind of touched on this already, but Q6 `Logger::m_categories` was wrapped by `std::atomic`. Why was that needed? Why is it no longer needed?

113

17:45

<roze_paul> i believe we replaced std::atomic with a rw-lock?

114

17:46

<kouloumos> interesting! I've seen that for functional tests there are some logging categories that we ignore, could this become be an issue if such regression occurs for those categories?

115

17:46

<LarryRuane> roze_paul: yes but just a regular mutex lock, not a read-write lock

116

17:47

<roze_paul> @LarryRuane that's the StdLockGuard scoped_lock ?

117

17:47

<LarryRuane> kouloumos: good point!

118

17:47

<LarryRuane> roze_paul: correct

119

17:48

<kouloumos> I think it was needed because of concurrency due to different components wanting to access the logger. I was curious why it was now replaced with locks, but I think you already touched why.

120

17:49

<LarryRuane> kouloumos: yes, I couldn't figure out how to wrap a `std::bitset` variable within `std::atomic` (but probably worth trying harder!)

121

17:50

<LarryRuane> good transition to Q7 Following on to that question, some concern was expressed on the PR about performance, and the need for benchmarking. Why would this be important?

122

17:51

<LarryRuane> i see that @kouloumos just posted some really helpful benchmark results to the PR: https://github.com/bitcoin/bitcoin/pull/26697#pullrequestreview-1253717862

123

17:51

<kouloumos> Cause logging is a common operation, so a slowdown could have a significant impact

124

17:52

<LarryRuane> kouloumos: right, especially if the relevant logging category is _not_ enabled (if enabled, probably performance isn't of much concern)

125

17:52

<emzy> If I read that corectly than there is no slow down from the change. I'm right?

126

17:53

<LarryRuane> that's why I think LoggingNoCategory is the most important of those results

127

17:53

<LarryRuane> no, there's a significant slowdown in LoggingNoCategory

128

17:54

<LarryRuane> but it's hard to say how important that difference is (is it a drop in the ocean compared with a half-drop in the ocean? Or is it a drop in the glass of water?) ... depends on the surrounding code

129

17:54

<emzy> Oh I see.

130

17:55

<LarryRuane> Q8 An unknown P2P message does not cause a LogPrintf() message. Why not? Wouldn’t it be helpful to see if a peer is sending us a message we don’t understand?

131

17:55

<kouloumos> Also looking into those benchmarks, I was wondering how significant are the results of such a logging benchmark. They are benchmarking the performance of a single invocation, is this actually a good metric?

132

17:55

<codo> re Q8: The daemon should ignore any unknown messages so it can't be DOS'sed.

133

17:55

<brunoerg> codo: +1

134

17:55

<LarryRuane> yes I think benchmarks attempt to isolate just one particular area of the code, so I think that's good

135

17:56

<brunoerg> DoS is a good explanation

136

17:56

<b_101> can the change of locking method of m_categories cause an impact?

137

17:56

<kouloumos> +1 they were concerns about such a logging attack in the past https://github.com/bitcoin/bitcoin/issues/21559

138

17:56

<LarryRuane> codo: yes! that's exactly what i had in mind... what would be the nature of the DoS?

139

17:56

<roze_paul> codo +1

140

17:57

<LarryRuane> oh there's the answer to my question right in the title of 21559 -- disk filling

141

17:57

<brunoerg> cool

142

17:57

<LarryRuane> Q9 Does the same principle apply to RPC, REST, or ZMQ requests?

143

17:57

<roze_paul> q9 i would imagine yes. same attack vector

144

17:57

<emzy> For shure the contents of the unknown P2P message must not be logged.

145

17:58

<LarryRuane> emzy: only the contents? what about the fact that it happened?

146

17:58

<brunoerg> what is "unknown P2P message" for us?

147

17:58

<brunoerg> unknown message or any message from an unknown peer?

148

17:59

<LarryRuane> unknown message ... in a way, all peers are unknown :)

149

17:59

<LarryRuane> I'm thinking that for Q9, no, because those sources are trusted (can do lots of other harmful things anyway)

150

17:59

<emzy> LarryRuane: seems to be relevant that there are unknown messages. Could be also a local problem. So I would like to have that option.

151

18:00

<LarryRuane> yes I think logging unknown messages from RPC etc. should be logged .. is that what you're saying?

152

18:00

<LarryRuane> we're at time, didn't get to Q10 The Logger::WillLogCategory is annotated with const; what does this mean? How is it possible to modify the object by locking m_cs?

153

18:01

<emzy> But LogPrintf() is unconditionallly. So It sould be in LogPrint() net

154

18:01

<LarryRuane> but if anyone would like to discuss, I'll stick around!

155

18:01

<LarryRuane> #endmeeting

156

18:01

<glozow> thanks LarryRuane! great meeting

157

18:01

<effexzi> Thanks every1

158

18:01

<codo> thank you LarryRuane a lot

159

18:01

<emzy> Thank you LarryRuane and all others!

160

18:01

<d33r_gee> thanks LarryRuane and every1

161

18:02

<roze_paul> q10 i couldn't find where m_cs was first declared, so i don't understand what m_cs even `is`

162

18:02

<roze_paul> thanks larry!!!

163

18:02

<brunoerg> Thanks LarryRuane

164

18:02

<svav> Thanks LarryRuane and all!

165

18:02

<brunoerg> learned a lot

166

18:02

<LarryRuane> thanks to all who participated! was really fun for me to host!

167

18:02

<kouloumos> Thank you LarryRuane!

168

18:06

<codo> I'd like to discuss Q10 also if more are interested.

169

18:06

<b_101> thnks LarryRuane: for hosting and everyone!

170

18:06

<b_101> yes I would like to understand Q10

171

18:07

<codo> I think the first part might be: the annotation with const is good practice

172

18:07

<codo> The second part of the question I do not understand

173

18:10

<b_101> const means no data will be changed by the fuction, right?

174

18:11

<b_101> they can't change the data members to be more precise

175

18:12

<codo> yes that is how I understand it also

176

18:12

<codo> but the function does not change anything, so that is why I think it is for good practice

177

18:13

<codo> that it shouts out "I'm only reading, not writing"

178

18:15

<b_101> ok, not sure about the second part either

179

18:16

<LarryRuane> b_101: both of you are on exactly the right track ... `const` means that calling this method won't change the state of the object (it's a "read-only" call)

180

18:16

<LarryRuane> but if a member variable is labeled `mutable`, then the method _can_ change the variable

181

18:16

<LarryRuane> (just that variable)

182

18:17

<LarryRuane> mostly done for locking

183

18:17

<codo> ah

184

18:17

<LarryRuane> https://en.cppreference.com/w/cpp/language/cv refers to the M&M rule: mutable and mutex go together

185

18:18

<LarryRuane> there's one other common use of `mutable` besides locking ... do you know what it is?

186

18:19

<codo> I don't

187

18:19

<b_101> so in this case m_cs is the mutable data member?

188

18:19

<LarryRuane> right

189

18:21

<b_101> so this `StdLockGuard scoped_lock(m_cs);` mutates m_cs?

190

18:22

<LarryRuane> yes

191

18:22

<b_101> thx LarryRuane: very interesting

192

18:23

<LarryRuane> that acquires (locks) `m_cs` ... then it also get mutated (unlocked) when `scoped_lock` goes out of scope (by its destructor)

193

18:23

<b_101> right, thanks for clarifying that!

194

18:23

<LarryRuane> `mutable` is also often used for member variables that are merely a cache of some kind, for performance ... doesn't change functionality

195

18:24

<LarryRuane> here's an example (you can see from the name of the variable member): https://github.com/bitcoin/bitcoin/blob/aef8b4f43b0c4300aa6cf2c5cf5c19f55e73499c/src/coins.h#L220

196

18:25

<LarryRuane> the comment just above that line is very helpful

197

18:25

<b_101> yes

198

18:26

<LarryRuane> but `mutable` does allow you to cheat, in that a const method can make changes that are actual functional changes! i.e. change the _logical_ state of the object

199

18:26

<b_101> I have been studying mutex and locks, a little confused why Bitcoin Core has so many LOC macros instead of using standard c++ lock functions

200

18:27

<LarryRuane> i.e. nothing in the language verifies that your use of `mutable` and `const` functions are non-functional ... there's probably no way to automatically check that

201

18:27

<b_101> LarryRuane: ok, thx

202

18:29

<LarryRuane> b_101: yes, I think `LOCK()` is the most common way to do locking (I don't know why the logging subsystem doesn't use `LOCK`), but what that macro does is actually declare a (local) variable with some constructed artificial name,

203

18:30

<LarryRuane> and that variable's constructor actually does the mutex lock, and its destructor does the unlock.. which is very clever! it's hard to make the mistake of forgetting to unlock a mutex (like it is if you're doing unlocking explicitly)

204

18:31

<LarryRuane> you may notice lots of places where there's an opening brace (`{`) then a call to `LOCK`, then a small amount of code, then the close brace ... that's to limit the duration of the mutex locking to just the block of code (not locked after the close brace)

205

18:32

<b_101> yes, I still have to read more and make some basic c++ toy projects to fullly understand it

206

18:32

<LarryRuane> `StdLockGuard scoped_lock(m_cs)` has that same property as `LOCK` where the unlock is automatic, but with `StdLockGuard` you're making the variable visible (you choose the name), instead of it being hidden within the `LOCK` macro

207

18:33

<LarryRuane> and as i said, i don't really know why it's done that way in the logging subsystem ... maybe historical? maybe `LOCK` didn't exist when the logging code was written? not sure