WIP: docs: fixed formatting of many manpages #86

Upon review I realize my man pages had a lot of errors and wrinkles. Your changes are in sum an improvement but I find some of them to be questionable.

docs/dj.1 Outdated

					
				@ -61,1 +52,4 @@

				Takes no arguments and pads with nuls.

				.RE

				.B -B

trinity commented

2024-03-27 01:29:38 +00:00

It would make more sense to order the options -ibscaAoBSHqd in explanation. Input, input options, alightment, output, output options, diagnostic options. Capitals before lowercases is especially confusing in the context of dj(1).

It would make more sense to order the options `-ibscaAoBSHqd` in explanation. Input, input options, alightment, output, output options, diagnostic options. Capitals before lowercases is especially confusing in the context of dj(1).

emma commented

2024-04-18 14:34:44 +00:00

I am still under the persuasion that we should order them alphabetically

trinity commented

2024-04-20 03:08:49 +00:00

I am still not.

emma commented

2024-04-24 01:24:33 +00:00

@silt what are your thoughts

silt commented

2024-04-29 02:32:37 +00:00

i think that in general, it makes sense to sort/group options logically, rather than alphabetically. when i go to a manpage to look at an option, options that are in some way related are far more useful to have close by than options that just happen to be alphabetical neighbors. the only benefit i see to alphabetical ordering is being able to quickly find a specific option in a long list of them, but that's not a good reason. quickly finding some text is what grep and its siblings are for.

i think that in general, it makes sense to sort/group options logically, rather than alphabetically. when i go to a manpage to look at an option, options that are in some way related are far more useful to have close by than options that just happen to be alphabetical neighbors. the only benefit i see to alphabetical ordering is being able to quickly find a specific option in a long list of them, but that's not a good reason. quickly finding some text is what `grep` and its siblings are for.

👍 1 ❤️ 1 🎉 1

emma commented

2024-04-29 02:43:55 +00:00

@trinity what are your thoughts on the latest change I made?

trinity commented

2024-04-29 10:35:02 +00:00

Love it.

docs/dj.1 Outdated

					
				@ -69,0 +61,4 @@

				.B -H

				.RS

				Prints diagnostics messages in an alternate manner as described in the

trinity commented

2024-03-27 01:30:27 +00:00

"in an alternate, human-readable format" would be better; the 'H' stands for Human.

emma marked this conversation as resolved

docs/dj.1 Outdated

					
				@ -71,2 +69,4 @@

				.RS

				Skips a number of bytes through the output before starting to write from

				the input. If the input is a stream the bytes are read and discarded. If the

				output is a stream, nul characters are printed.

trinity commented

2024-03-27 01:32:03 +00:00

"If the output is a stream, nul bytes are printed." Input is irrelevant here (this may be my own error).

emma marked this conversation as resolved

docs/dj.1 Outdated

 @ -80,1 +76,3 @@
 The
 .RS
 Takes one argument of one byte in length and pads the input buffer with
 that byte in the event that a read doesn’t fill the input buffer, and the

trinity commented

2024-03-27 01:34:44 +00:00

"and the"?..

-a pads the input buffer with the given byte in the event of an incomplete read from the input file. -A instead pads with the nul byte.

"and the"?.. `-a` pads the input buffer with the given byte in the event of an incomplete read from the input file. `-A` instead pads with the nul byte.

emma marked this conversation as resolved

docs/dj.1 Outdated

					
				@ -85,0 +92,4 @@

				.B -d

				.RS

				Prints all debug information, user-specified or otherwise, before program

trinity commented

2024-03-27 01:36:03 +00:00

Specifically prints information related to invocation.

emma commented

2024-03-27 05:55:32 +00:00

Please elaborate.

trinity commented

2024-03-27 14:20:06 +00:00

See src/dj.c:370 which is the only thing that happens when the debug level is greater than 2 (the default).

Here's the stderr of dj -d:

argv0=dj
in=<stdin>      ibs=1024        skip=0  align=ff       count=0
out=<stdout>    obs=1024        seek=0  debug= 3       noerror=0

align is shown to be ff here because the two's complement representation of its sentry value (-1) is 0b 1111 1111 1111 1111 and only the lower 8b are used for alignment, or allowed when taking a new value as an argument (thus the sentry value can never be chosen by the user), or shown in the debug output.

See [src/dj.c:370](https://git.tebibyte.media/bonsai/coreutils/src/branch/main/src/dj.c#L370) which is the only thing that happens when the `debug` level is greater than `2` (the default). Here's the stderr of `dj -d`: ``` argv0=dj in=<stdin> ibs=1024 skip=0 align=ff count=0 out=<stdout> obs=1024 seek=0 debug= 3 noerror=0 ``` `align` is shown to be `ff` here because the two's complement representation of its sentry value (`-1`) is `0b 1111 1111 1111 1111` and only the lower 8b are used for alignment, or allowed when taking a new value as an argument (thus the sentry value can never be chosen by the user), or shown in the debug output.

emma marked this conversation as resolved

docs/dj.1 Outdated

					
				@ -85,0 +98,4 @@

				.B -i

				.RS

				Takes a path as an argument to open and use in place of standard input.

trinity commented

2024-03-27 01:37:03 +00:00

- can be used to mean standard input or standard output. This may be noted elsewhere but is relevant here as well.

`-` can be used to mean standard input or standard output. This may be noted elsewhere but is relevant here as well.

emma marked this conversation as resolved

docs/dj.1

					
				@ -85,0 +103,4 @@

				.B -n

				.RS

				Causes dj to exit on two consecutive empty reads instead of one.

trinity commented

2024-03-27 01:38:09 +00:00

Causes dj to give failed reads or writes a second try.

emma marked this conversation as resolved

docs/dj.1 Outdated

					
				@ -85,0 +111,4 @@

				Does the same as

				.B -i

				but in place of standard output. Dj does not truncate output

				files and instead writes over the bytes in the existing file.

trinity commented

2024-03-27 01:39:09 +00:00

I think this would be more appropriate in a BUGS or CAVEATS section, perhaps with a "See BUGS" in the "-o" option.

emma marked this conversation as resolved

docs/dj.1 Outdated

					
				@ -85,0 +117,4 @@

				.B -s

				.RS

				Takes a numeric argument as the number of bytes to skip into the input

				before starting to read.

trinity commented

2024-03-27 01:39:58 +00:00

If standard input is used, the bytes are read and discarded.

emma marked this conversation as resolved

docs/dj.1

					
				@ -85,0 +125,4 @@

				Suppresses error messages which print when a read or write is partial or

				empty. When

				.B -q

				is specified twice suppresses diagnostic output entirely.

trinity commented

2024-03-27 01:41:09 +00:00

It should be mentioned that -q and -d respectively decrement and increment the debug level of the program.

It should be mentioned that `-q` and `-d` respectively decrement and increment the debug level of the program.

emma marked this conversation as resolved

docs/dj.1 Outdated

					
				@ -151,1 +180,3 @@

				use.

				The dd(1p) utility specified in POSIX was the basis of this program.

				It includes additional features: typical option formatting, allowing seeks to be

trinity commented

2024-03-27 01:43:48 +00:00

What is "it"?

emma marked this conversation as resolved

docs/dj.1 Outdated

 @ -152,0 +183,4 @@
 specified in bytes rather than in blocks, allowing arbitrary bytes as padding,
 and printing in a format that’s easy to parse for machines. It also neglects
 character conversion. This may have been the original intent of dd(1p) but it is
 irrelevant to its modern use as a disk utility.

trinity commented

2024-03-27 02:00:02 +00:00

"its modern use". Its modern use is more as a file utility in some contexts (doas dd of=/root/accessible/only, dd bs=bytes count=1) and a disk utility (doas dd of=/dev/disk, dd if=/dev/hd bs=512 count=1 of=disktable) in other contexts - distinguished by user stress. dd(1p) is no more a disk utility than any other UNIX utility and probably not even a great tool for the job (a 512B buffer sucks for disk image writing - it's way too small!).

It varies greatly per user per context so leaving it ambiguous would be best.

"its modern use". Its modern use is more as a file utility in some contexts (`doas dd of=/root/accessible/only`, `dd bs=bytes count=1`) and a disk utility (`doas dd of=/dev/disk`, `dd if=/dev/hd bs=512 count=1 of=disktable`) in other contexts - distinguished by user stress. dd(1p) is no more a disk utility than any other UNIX utility and probably not even a great tool for the job (a 512B buffer *sucks* for disk image writing - it's way too small!). It varies greatly per user per context so leaving it ambiguous would be best.

emma marked this conversation as resolved

docs/false.1

					
				@ -15,2 +15,2 @@

				False does nothing regardless of operands or standard input.

				False will always return an exit code of 1.

				Do nothing regardless of operands or standard input.

				An exit code of 1 will always be returned.

trinity commented

2024-03-27 04:05:34 +00:00

Better: "Do nothing, unsuccessfully."

emma commented

2024-03-27 06:19:31 +00:00

I’m worried that slogan would force us under the GNU Free Documentation license as it is from the GNU man page for their implementation of false(1).

I’m worried that slogan would force us under the GNU Free Documentation license as it is from the GNU man page for their implementation of `false(1)`.

emma commented

2024-04-18 14:37:26 +00:00

@trinity did you have any thoughts on this?

trinity commented

2024-04-20 03:09:38 +00:00

I think you're right here.

trinity marked this conversation as resolved

docs/intcmp.1 Outdated

					
				@ -20,3 +20,3 @@

				.SH DESCRIPTION

				Intcmp compares integers.

				Compare integers.

trinity commented

2024-03-27 04:07:35 +00:00

This infinitive present tense for descriptions feels off and I think this is a good example of why. "Compare integers" - who, what, when, where, why, how? It's easy to reference but more difficult to puzzle out for the casual reader.

emma marked this conversation as resolved

docs/intcmp.1

					
				@ -33,0 +29,4 @@

				.B -g

				or

				.B -l

				, only adjacent integers in the argument sequence can be equal.

trinity commented

2024-03-27 04:08:21 +00:00

Every comparison only compares with the integers next to it.

emma commented

2024-03-27 06:10:52 +00:00

Please elaborate on what is wrong here.

trinity commented

2024-03-27 13:52:27 +00:00

See src/intcmp.c:62:

	do{	r = c;
		c = strtol(argv[i], &argv[i], 10);
		if(*argv[i] != '\0' || errno != 0){
			fprintf(stderr, "%s: argument #%d: Invalid integer\n",
				argv[0], (int)i);
			return EX_USAGE;
		}

		if(i == optind)
			continue;

		/* rule enforcement; if a mode isn't permitted and the numbers
		 * correspond to it, return 1 */
		if(		(!(mode & EQUAL) && r == c)
				|| (!(mode & GREATER) && r > c)
				|| (!(mode & LESS) && r < c))
			return 1;
	}while(++i < argc);

c is the current integer, r is the reference integer to which c is compared. Only adjacent integers are ever compared. Equality is always cohingent on adjacency; perhaps argv [1] and [3] can be equal in 1 == 1 == 1 whereas 1 >= 2 >= 1 is an invalid equation, but that's just the function of the comparisons there.

See [src/intcmp.c:62](https://git.tebibyte.media/bonsai/coreutils/src/branch/main/src/intcmp.c#L62): ```c do{ r = c; c = strtol(argv[i], &argv[i], 10); if(*argv[i] != '\0' || errno != 0){ fprintf(stderr, "%s: argument #%d: Invalid integer\n", argv[0], (int)i); return EX_USAGE; } if(i == optind) continue; /* rule enforcement; if a mode isn't permitted and the numbers * correspond to it, return 1 */ if( (!(mode & EQUAL) && r == c) || (!(mode & GREATER) && r > c) || (!(mode & LESS) && r < c)) return 1; }while(++i < argc); ``` `c` is the current integer, `r` is the reference integer to which `c` is compared. Only adjacent integers are ever compared. Equality is always cohingent on adjacency; perhaps argv [1] and [3] can be equal in `1 == 1 == 1` whereas `1 >= 2 >= 1` is an invalid equation, but that's just the function of the comparisons there.

emma commented

2024-03-29 22:25:24 +00:00

I’m not really sure what I need to change here, then.

trinity commented

2024-04-10 03:03:26 +00:00

"Permits adjacent integers to be equal to each other" is sufficient to describe the full functionality.

emma marked this conversation as resolved

docs/intcmp.1

					
				@ -48,3 +66,3 @@

				There are multiple ways to express compound comparisons; “less than or equal

				to” can be -le or -el, for example.

				.PP

trinity commented

2024-03-27 04:11:41 +00:00

Is this replacement portable?

emma commented

2024-03-27 05:47:58 +00:00

I could not find reference to .PP in roff(7).

I could not find reference to `.PP` in `roff(7)`.

trinity commented

2024-03-27 13:37:45 +00:00

Huh. Now that you mention it, I can't either.

trinity marked this conversation as resolved

docs/mm.1 Outdated

					
				@ -39,2 +28,2 @@

				standard output. Standard output itself can be specified by giving the

				path '-'. Standard error itself can be specified with the

				.RS

				Opens subsequent outputs for appending rather than updating.

trinity commented

2024-03-27 04:14:27 +00:00

s/subsequent//. I realize options are only supported prior to positional arguments.

`s/subsequent//`. I realize options are only supported prior to positional arguments.

emma marked this conversation as resolved

docs/mm.1 Outdated

					
				@ -43,2 +33,2 @@

				.PP

				The

				.RS

				Set the output to the standard error.

trinity commented

2024-03-27 04:15:00 +00:00

Use standard error as an output.

emma marked this conversation as resolved

docs/mm.1 Outdated

					
				@ -61,3 +68,3 @@

				.SH BUGS

				Mm does not truncate existing files, which may lead to unexpected results.

				Existing files are not truncated, which may lead to unexpected results.

trinity commented

2024-03-27 04:23:20 +00:00

This is inconsistent with the changes made to the dj(1) man page, possibly to my original man pages but I haven't checked.

emma marked this conversation as resolved

docs/mm.1 Outdated

					
				@ -67,1 +74,3 @@

				Mm was modeled after the cat and tee utilities specified in POSIX.

				The cat(1p) and tee(1p) programs specified in POSIX provide equivalent

				functionality. The separation of the two sets of functionality into separate

				APIs seemed unncessary.

trinity commented

2024-03-27 04:38:57 +00:00

cat(1p) and tee(1p) don't provide equivalent functionality; cat(1p) doesn't specify a way to ignore SIGINT and tee(1p) doesn't specify a way to ensure output is unbuffered.

Perhaps sh -ec 'trap SIGINT true; cat' would ignore SIGINT with cat(1p) and sh(1p), and sh -c 'dd bs=1 >>file' would append, unbuffered, to file. But I'm not sure if sh(1p)'s trap` works like this, and I don't know if it buffers file redirections, and these are still only achievable with the addition of sh(1p).

cat(1p) and tee(1p) don't provide equivalent functionality; cat(1p) doesn't specify a way to ignore SIGINT and tee(1p) doesn't specify a way to ensure output is unbuffered. Perhaps `sh -ec 'trap SIGINT true; cat'` would ignore `SIGINT` with cat(1p) and sh(1p), and `sh -c 'dd bs=1 >>file' would append, unbuffered, to `file`. But I'm not sure if sh(1p)'s `trap` works like this, and I don't know if it buffers file redirections, and these are still only achievable with the addition of sh(1p).

emma marked this conversation as resolved

docs/npc.1 Outdated

					
				@ -25,0 +21,4 @@

				The program reads from standard input and writes to standard output, replacing

				non-printing characters with printable equivalents. Control characters print as

				a carat (“^”) followed by the character “@” through “_” corresponding to the

trinity commented

2024-03-27 04:42:16 +00:00

These were single quoted to indicate, following C conventions, specifically that they are ASCII bytes and not strings.

emma marked this conversation as resolved

docs/str.1 Outdated

					
				@ -20,3 +20,2 @@

				Str tests each character in an arbitrary quantity of string arguments against

				the function of the same name within ctype(3).

				Test string arguments against each other.

trinity commented

2024-03-27 04:44:54 +00:00

Against... each other?

emma marked this conversation as resolved

docs/str.1 Outdated

					
				@ -38,1 +39,3 @@

				('').

				Originally, there was an isvalue type as an extension to ctype.h(3), but it

				was removed in favor of using strcmp(1) to compare strings against the empty

				string ('').

trinity commented

2024-03-27 04:45:49 +00:00

I think we can remove this as I am probably the only one that used isvalue.

I think we can remove this as I am probably the only one that used `isvalue`.

emma marked this conversation as resolved

docs/strcmp.1 Outdated

					
				@ -39,2 +39,2 @@

				Unicode strings may need to be normalized if the intent is to check visual

				similarity and not byte similarity.

				The program will exit unsuccessfully if the given strings are not identical;

				therefore, unicode strings may need to be normalized if the intent is to check

trinity commented

2024-03-27 04:46:36 +00:00

Unicode is a proper noun.

emma marked this conversation as resolved

emma added 10 commits 2024-03-27 06:18:19 +00:00

a1902df503

strcmp.1: Unicode is a proper noun

f565f0530b

dj.1: dd(1p) is not a disk utility

bb43533a37

dj.1: -A and -a: fix descriptions

49e2022e52

dj.1: -d, -i, -o, fixed descriptions

a2188dc674

dj.1: fix -H description

6158a39a4a

dj.1: consistency with mm.1

3cb37d830a

mm.1: wording, consistency with dj.1

d3bfc7b1f5

npc.1: ASCII bytes

cdd8e79b01

str.1: strings are not tested against each other

603d8ee1d8

str.1: remove extraneous former implementation information

trinity requested changes 2024-03-27 16:58:17 +00:00

trinity left a comment

Just some nitpicks and one goof on my end.

docs/dj.1 Outdated

					
				@ -48,0 +51,4 @@

				.RS

				If the output is a stream, nul bytes are printed. In other words, it does what

				.B -a

				does but with null bytes instead.

trinity commented

2024-03-27 13:54:28 +00:00

Nul bytes; "nul" as in ASCII NUL, as in the zero byte ('\0').

Nul bytes; "nul" as in ASCII NUL, as in the zero byte (`'\0'`).

emma commented

2024-03-29 22:20:00 +00:00

But in writing it is a null byte, because it is a byte that is null. The NUL representation is used in the context of displaying control characters.

But in writing it is a [null byte](https://en.wikipedia.org/wiki/Null_character), because it is a byte that is null. The NUL representation is used in the context of displaying control characters.

trinity commented

2024-04-10 02:51:07 +00:00

While that is true, nul with a single L is used to refer to an eight-bit (theoretically, maybe even a seven-bit) zero value given in text encoding. Null with two Ls is often used to refer to the null (zero) memory address, which is typically somewhere between 16 and 48 bits. While the word "null" itself just refers to a zero value, "nul" implies a length in bits that "null" leaves ambiguous. Because otherwise they function identically I prefer to refer to any instance of the byte or character '\0' as nul regardless of use.

There are also contexts where dd(1p) pads with ' ' (a literal 0x20 if I recall my ASCII). If someone is unreasonably knowledgeable regarding particular dd(1p) usages, calling it a "nul" byte because it is a single ASCII character may make it clear for them that our alignment is an analogue to dd(1p)'s conv=sync without, in the dj(1) man page), discussing in depth a tool that is not dj(1).

While that is true, nul with a single L is used to refer to an eight-bit (theoretically, maybe even a seven-bit) zero value given in text encoding. Null with two Ls is often used to refer to the null (zero) memory address, which is typically somewhere between 16 and 48 bits. While the word "null" itself just refers to a zero value, "nul" implies a length in bits that "null" leaves ambiguous. Because otherwise they function identically I prefer to refer to any instance of the byte or character `'\0'` as nul regardless of use. There are also contexts where dd(1p) pads with `' '` (a literal `0x20` if I recall my ASCII). If someone is unreasonably knowledgeable regarding particular dd(1p) usages, calling it a "nul" byte because it is a single ASCII character may make it clear for them that our alignment is an analogue to dd(1p)'s `conv=sync` without, in the dj(1) man page), discussing in depth a tool that is not dj(1).

emma commented

2024-04-18 14:38:18 +00:00

Null with two Ls is often used to refer to the null (zero) memory address, which is typically somewhere between 16 and 48 bits.

But is it not clear if it is a null byte (which is 8 bits in length)?

> Null with two Ls is often used to refer to the null (zero) memory address, which is typically somewhere between 16 and 48 bits. But is it not clear if it is a null *byte* (which is 8 bits in length)?

trinity commented

2024-04-20 03:15:41 +00:00

"Null byte" makes sense but "nul byte" makes sense faster and indicates specifically the ASCII zero value versus an arbitrarily-sized ("byte" is sadly not always specific enough) value.

emma commented

2024-04-24 01:24:59 +00:00

The problem is that null is a word and nul is a representation in practice.

trinity commented

2024-04-29 10:34:19 +00:00

I think I'm willing to cede this hill.

trinity marked this conversation as resolved

docs/dj.1 Outdated

					
				@ -70,2 +71,3 @@

				option skips a number of bytes through the output before starting to write from

				.RS

				Skips a number of bytes through the output before starting to write from

				the input. If the input is a stream the bytes are read and discarded. If the

trinity commented

2024-03-27 14:10:16 +00:00

-S only configures the output.

`-S` only configures the output.

emma commented

2024-03-29 22:20:39 +00:00

Please elaborate.

trinity commented

2024-04-10 02:52:20 +00:00

Whether or not the input is a stream is irrelevant to the function of the option -S.

Whether or not the input is a stream is irrelevant to the function of the option `-S`.

emma marked this conversation as resolved

docs/dj.1 Outdated

					
				@ -85,0 +129,4 @@

				.SH STANDARD INPUT

				The standard input shall be used as an input if one or more of the input files

				is “-”.

trinity commented

2024-03-27 16:46:34 +00:00

Or by default.

emma marked this conversation as resolved

docs/mm.1 Outdated

					
				@ -39,2 +28,2 @@

				standard output. Standard output itself can be specified by giving the

				path '-'. Standard error itself can be specified with the

				.RS

				Opens outputs for appending rather than updating.

trinity commented

2024-03-27 16:54:41 +00:00

I know I corrected this but upon further reflection I have to fix this: -a opens subsequent outputs for appending, because outputs aren't specified positionally but optionally and therefore invocations like mm -o - -o start -ao append do open standard output and start for writing to the start and open append for appending. I was mistaken.

I know I corrected this but upon further reflection I have to fix this: `-a` opens *subsequent* outputs for appending, because outputs aren't specified positionally but optionally and therefore invocations like `mm -o - -o start -ao append` do open standard output and `start` for writing to the start and open `append` for appending. I was mistaken.

emma marked this conversation as resolved

docs/mm.1 Outdated

					
				@ -45,0 +37,4 @@

				.B -i

				.RS

				Opens a path as an input. Without any inputs specified mm will use the

				standard input.

trinity commented

2024-03-27 16:55:26 +00:00

"-" will use standard input or standard output.

`"-"` will use standard input or standard output.

emma marked this conversation as resolved

docs/mm.1 Outdated

					
				@ -66,2 +78,3 @@

				Mm was modeled after the cat and tee utilities specified in POSIX.

				The cat(1p) and tee(1p) programs specified in POSIX together provide nearly

				equivalent functionality. The separation of the two sets of functionality into

trinity commented

2024-03-27 16:56:36 +00:00

"similar functionality".

emma marked this conversation as resolved

emma added 4 commits 2024-03-29 22:25:45 +00:00

70b0c2f924

dj.1: fixed -d description

4e33f945ae

dj.1: null bytes

9ea57a27b7

dj.1: stdin by default

63c8ff8093

intcmp.1: compares integers to each other

emma added 3 commits 2024-03-29 22:30:50 +00:00

abc599148d

mm.1: subsequent outputs are opened for appending

ce5a4dc4bd

mm.1: removed STANDARD INPUT section

5db09a5ca1

mm.1: cat(1p) and tee(1p) provide similar functionality

emma added 1 commit 2024-03-29 22:48:10 +00:00

13ee16173e

fop.1: initial commit

emma requested review from trinity 2024-04-10 01:47:49 +00:00

trinity requested changes 2024-04-10 03:05:51 +00:00

trinity left a comment

Increasingly rarer nitpicks.

docs/dj.1 Outdated

					
				@ -96,0 +76,4 @@

				.B -a

				.RS

				Takes one argument of one byte in length and pads the input buffer with it in

trinity commented

2024-04-10 02:54:00 +00:00

Could you change ^.*length to "Accepts a single literal byte"?

Could you change `^.*length` to "Accepts a single literal byte"?

emma marked this conversation as resolved

docs/dj.1 Outdated

					
				@ -96,0 +88,4 @@

				.B -c

				.RS

				Specifies an amount of reads to make, and if 0 (the default) dj will

trinity commented

2024-04-10 02:54:59 +00:00

It would be better if this was two sentences - ". If zero,".

It would be better if this was two sentences - "`. If zero,`".

emma marked this conversation as resolved

docs/dj.1 Outdated

					
				@ -107,4 +147,4 @@

				.R {records read} {ASCII unit separator} {partial records read}

				.R {ASCII record separator} {records written} {ASCII unit separator}

				.R {partial records written} {ASCII group separator} {bytes read}

				.R {ASCII record separator} {bytes written} {ASCII file separator}

trinity commented

2024-04-10 02:58:55 +00:00

I don't know if this should be noted in the man page but this diagnostic output is intended to be machine readable to make scripting easier. I've found dd(1p) to be not only needlessly verbose but also a pain in the ass in this regard.

I really like dj(1)'s -H. It made debugging very easy. Though I would be happy to be corrected with an even better output format.

I don't know if this should be noted in the man page but this diagnostic output is intended to be machine readable to make scripting easier. I've found dd(1p) to be not only needlessly verbose but also a pain in the ass in this regard. I really like dj(1)'s `-H`. It made debugging very easy. Though I would be happy to be corrected with an even better output format.

emma marked this conversation as resolved

docs/dj.1

					
				@ -116,3 +157,4 @@

				.RS

				.R {records read} '+' {partial records read} '>' {records written}

				.R '+' {partial records written} ';' {bytes read} '>' {bytes written}

				.R {ASCII line feed}

trinity commented

2024-04-10 03:00:23 +00:00

Though this output prioritizes human readability it was also meant to be machine readable in case that was necessary. I couldn't imagine why and I hope it never would be, but if it is, it's easy.

emma marked this conversation as resolved

docs/intcmp.1

					
				@ -20,3 +20,3 @@

				.SH DESCRIPTION

				Intcmp compares integers.

				Compare integers to each other.

trinity commented

2024-04-10 03:02:18 +00:00

With what else would the integers be compared?

emma commented

2024-04-18 14:46:09 +00:00

#86 (comment)

https://git.tebibyte.media/bonsai/coreutils/pulls/86#issuecomment-4120

trinity commented

2024-04-20 03:13:57 +00:00

This infinitive present tense for descriptions feels off and I think this is a good example of why. "Compare integers" - who, what, when, where, why, how? It's easy to reference but more difficult to puzzle out for the casual reader.

Rare self own. I think my much younger, more foolhardy self of March 2024 is overly cocky here; all that is necessary is to know that intcmp (who) compares integers (what) (and the rest of the man page is why, how). My problem was specifically with the infinitive. Next time I will be less arrogant.

> This infinitive present tense for descriptions feels off and I think this is a good example of why. "Compare integers" - who, what, when, where, why, how? It's easy to reference but more difficult to puzzle out for the casual reader. Rare self own. I think my much younger, more foolhardy self of March 2024 is overly cocky here; all that is necessary is to know that *intcmp* (who) *compares integers* (what) (and the rest of the man page is why, how). My problem was specifically with the infinitive. Next time I will be less arrogant.

emma commented

2024-04-22 15:55:32 +00:00

To be fair to me, this is not an infinitive, it is the second-person conjugation of the verb. The infinitive would be “to compare integers”, but it reads “[you] compare integers”. This phrasing is in line with many other man page descriptions I have read and I find it to be the best solution to the problem of program names being hard to fit into grammar (to capitalize or not to capitalize).

trinity commented

2024-04-22 22:32:49 +00:00

This phrasing is in line with many other man page descriptions I have read and I find it to be the best solution to the problem of program names being hard to fit into grammar (to capitalize or not to capitalize).

I don't know if this is exactly what you were trying to convey but I understand now - it's the same tense (and you are right, I was mistaken about it being infinitive :P) as the program names themselves. I like that.

> This phrasing is in line with many other man page descriptions I have read and I find it to be the best solution to the problem of program names being hard to fit into grammar (to capitalize or not to capitalize). I don't know if this is exactly what you were trying to convey but I understand now - it's the same tense (and you are right, I was mistaken about it being infinitive :P) as the program names themselves. I like that.

trinity marked this conversation as resolved

emma added 5 commits 2024-04-18 14:48:12 +00:00

df16707b0e

intcmp.1: -e permits adjacent integers to be equal to each other

3cdade71e2

dj.1: -S whether or not the input is a stream is irrelevant

ed284b9949

dj.1: -a: More specific wording

b41af1b578

dj.1: -c: grammar

187d9486b7

dj.1: debug output clarification

emma changed title from ~~docs: fixed formatting of many manpages~~ to WIP: docs: fixed formatting of many manpages

2024-04-24 21:00:58 +00:00

emma added 1 commit 2024-04-29 02:43:42 +00:00

432b19818e

made dj options no longer alphabetized

trinity reviewed 2024-04-29 10:26:16 +00:00

docs/dj.1

					
				@ -65,1 +56,3 @@

				The

				.RS

				Takes a numeric argument as the size in bytes of the input buffer, with the

				default being 1024 bytes or one kibibyte (KiB).

trinity commented

2024-04-29 10:26:16 +00:00

Perhaps that this is a kibibyte shouldn't be noted here. It may give the false impression that one could specify a SI prefix, e.g. dj -b 1KiB.

Perhaps that this is a kibibyte shouldn't be noted here. It may give the false impression that one could specify a SI prefix, e.g. `dj -b 1KiB`.

trinity reviewed 2024-04-29 10:33:29 +00:00

docs/dj.1

					
				@ -94,0 +100,4 @@

				If the output is a stream, null bytes are printed. This option is equivalent to

				specifying

				.B -a

				with a null byte instead of a character.

trinity commented

2024-04-29 10:33:29 +00:00

"-a but with null bytes; pads the input buffer with null bytes in the event of an incomplete read.

It's impossible to specify a null byte instead of a character. This may imply that doing so is possible.

"**-a** but with null bytes; pads the input buffer with null bytes in the event of an incomplete read. It's impossible to specify a null byte instead of a character. This may imply that doing so is possible.

trinity commented

2024-04-29 10:46:19 +00:00

(There is the workaround of having an empty argument; if I recall the sh(1p) builtin read supports this at least with its -b option - I remember this because an article trended on Hacker News recently where the crux of the issue was that the writer didn't understand nul termination. I might consided using that here but it would be as much of a special case code-wise as -A is.)

(There is the workaround of having an empty argument; if I recall the sh(1p) builtin `read` supports this at least with its `-b` option - I remember this because [an article trended on Hacker News recently](https://news.ycombinator.com/item?id=40166099) where the crux of the issue was that the writer didn't understand nul termination. I might consided using that here but it would be as much of a special case code-wise as `-A` is.)

silt commented

2024-05-08 03:15:26 +00:00

Option descriptions should start on the same line as the options themselves. See ls(1) for an example of what I mean.

Option descriptions should start on the same line as the options themselves. See `ls(1)` for an example of what I mean.

silt commented

2024-05-08 03:19:20 +00:00

Again, comparing the manpages with ls(1), we use far too many newlines to separate items. There should only be one newline before a header, including the first header (NAME).

Correct:

LS(1)                            User Commands                           LS(1)

NAME
       ls - list directory contents

SYNOPSIS
       ls [OPTION]... [FILE]...

DESCRIPTION
       List information about the FILEs (the current directory by default).
       Sort entries alphabetically if none of -cftuvSUX nor --sort is
       specified.

Whatever we're doing:

dj(1)                       General Commands Manual                      dj(1)




NAME
       dj – disk jockey


SYNOPSIS
       dj (-AdHnq) (-a [byte]) (-c [count])

       (-i [ input file ]) (-b [ input block size ]) (-s [ input offset ])

       (-o [ output file ]) (-B [ output block size ]) (-S [ output offset ])

Again, comparing the manpages with `ls(1)`, we use far too many newlines to separate items. There should only be one newline before a header, including the first header (`NAME`). Correct: ```man LS(1) User Commands LS(1) NAME ls - list directory contents SYNOPSIS ls [OPTION]... [FILE]... DESCRIPTION List information about the FILEs (the current directory by default). Sort entries alphabetically if none of -cftuvSUX nor --sort is specified. ``` Whatever we're doing: ```man dj(1) General Commands Manual dj(1) NAME dj – disk jockey SYNOPSIS dj (-AdHnq) (-a [byte]) (-c [count]) (-i [ input file ]) (-b [ input block size ]) (-s [ input offset ]) (-o [ output file ]) (-B [ output block size ]) (-S [ output offset ]) ```

silt reviewed 2024-05-08 03:22:36 +00:00

docs/dj.1

					
				@ -56,2 +51,2 @@

				.PP

				The

				.RS

				Takes a file path as an argument to open and use as an input.

silt commented

2024-05-08 03:22:36 +00:00

This could probably be rephrased for clarity. It's not hard to read this as the argument being the thing that gets opened, rather than the file specified by the path within that argument. Obviously that reading makes no sense, but I still think it could be rephrased.

silt reviewed 2024-05-08 03:22:56 +00:00

docs/dj.1

					
				@ -69,0 +67,4 @@

				.B -o

				.RS

				Takes a file path as an argument to open and use as an output.

silt commented

2024-05-08 03:22:56 +00:00

See https://git.tebibyte.media/bonsai/coreutils/pulls/86/files#issuecomment-4463

silt reviewed 2024-05-08 03:27:35 +00:00

docs/dj.1

					
				@ -73,2 +81,2 @@

				.PP

				The

				.RS

				Skips a number of bytes through the output before starting to write from

silt commented

2024-05-08 03:27:35 +00:00

Should clarify the difference between skipping n bytes and seeking to the nth byte.

cc @trinity

Should clarify the difference between skipping *n* bytes and seeking to the *n*th byte. cc @trinity

trinity commented

2024-05-08 05:26:26 +00:00

The former. I'd have to think about how to word it.

silt reviewed 2024-05-08 03:28:46 +00:00

docs/dj.1

					
				@ -92,2 +93,2 @@

				.B -q

				is specified a second time. The

				.RS

				Specifies a number of reads to make. If set to zero (the default), reading will

silt commented

2024-05-08 03:28:46 +00:00

@emma and I spoke verbally about this; fae wants to rewrite this to be less clunky.

silt reviewed 2024-05-08 03:29:56 +00:00

docs/dj.1

					
				@ -81,1 +89,4 @@

				of an incomplete read from the input file.

				.RE

				.B -c

silt commented

2024-05-08 03:29:56 +00:00

Reminder to @emma to swap the positions of -c and -A.

Reminder to @emma to swap the positions of `-c` and `-A`.

silt reviewed 2024-05-08 03:32:30 +00:00

docs/dj.1

					
				@ -96,0 +129,4 @@

				.SH STANDARD INPUT

				The standard input shall be used as an input if no inputs are specified one or

silt commented

2024-05-08 03:32:30 +00:00

The standard input shall be used as an input if no inputs are specified or if one or

> The standard input shall be used as an input if no inputs are specified **or if** one or

silt reviewed 2024-05-08 03:33:17 +00:00

docs/dj.1

					
				@ -106,1 +137,3 @@

				.PP

				On a partial or empty read, a diagnostic message is printed (unless the

				.B -q

				option is specified) and the program exits (unless the

silt commented

2024-05-08 03:33:17 +00:00

error: unmatched parenthesis on line 139

trinity commented

2024-05-08 13:49:49 +00:00

Good catch.

silt reviewed 2024-05-08 03:33:58 +00:00

docs/dj.1

					
				@ -107,0 +140,4 @@

				.B -n

				option is specified.

				By default statistics are printed for input and output to the standard error in

silt commented

2024-05-08 03:33:58 +00:00

By default, statistics are printed for input and output to the standard error in

Note the added comma.

> By default, statistics are printed for input and output to the standard error in Note the added comma.

silt reviewed 2024-05-08 03:36:45 +00:00

docs/dj.1

					
				@ -128,0 +164,4 @@

				If the

				.B -d

				option is specified, debug output will be printed at the beginning of execution.

				This debug information contains information regarding how the program was

silt commented

2024-05-08 03:36:45 +00:00

Merge this line with the above line.

silt reviewed 2024-05-08 03:41:32 +00:00

docs/dj.1

					
				@ -128,2 +179,4 @@

				diagnostic message is printed and the program exits with the appropriate

				sysexits.h(3) status.

				.SH BUGS

silt commented

2024-05-08 03:41:32 +00:00

If -n is specified along with a count, actual byte output may

Removed superfluous word.

> If -n is specified along with a count, actual byte output may Removed superfluous word.

silt reviewed 2024-05-08 03:42:10 +00:00

docs/dj.1

					
				@ -96,0 +118,4 @@

				.B -n

				.RS

				Retries failed reads once more before exiting.

silt commented

2024-05-08 03:42:10 +00:00

Default is 1, but a number can be specified. This should be made clear here.

trinity commented

2024-05-08 05:33:51 +00:00

A number can be specified for -n in dj(1)?

emma commented

2024-05-08 14:17:43 +00:00

https://git.tebibyte.media/bonsai/coreutils/pulls/86/files#issuecomment-4481

<https://git.tebibyte.media/bonsai/coreutils/pulls/86/files#issuecomment-4481>

trinity commented

2024-05-11 01:02:11 +00:00

That meant using -n and -c in tandem.

That meant using `-n` and `-c` in tandem.

silt reviewed 2024-05-08 03:43:10 +00:00

docs/dj.1

					
				@ -136,25 +189,29 @@ expected (the product of the count multiplied by the input block size). If the

				or

				.B -A

				options are used this could make data written nonsensical.

silt commented

2024-05-08 03:43:10 +00:00

options are used, this could make data written nonsensical.

Added comma.

> options are used, this could make data written nonsensical. Added comma.

silt reviewed 2024-05-08 03:44:40 +00:00

docs/dj.1

					
				@ -138,3 +191,3 @@

				options are used this could make data written nonsensical.

				.PP

				Many lowercase options have capitalized variants and vice-versa which can be

silt commented

2024-05-08 03:44:40 +00:00

@emma says that this should be moved to CAVEATS.

@emma says that this should be moved to `CAVEATS`.

silt reviewed 2024-05-08 03:45:51 +00:00

docs/dj.1

					
				@ -148,0 +203,4 @@

				This program was based on the dd(1p) utility as specified in POSIX. While

				character conversion may have been the original intent of dd(1p), it is

				irrelevant to its modern use. Because of this, it eschews character conversion

				and adds typical option formatting, allowing seeks to be specified in bytes

silt commented

2024-05-08 03:45:51 +00:00

Again, seeking vs skipping. We need to land on one of them and stick to it, and make the behavior clear.

silt reviewed 2024-05-08 03:46:12 +00:00

docs/dj.1

					
				@ -147,1 +203,3 @@

				features: typical option formatting, allowing seeks to be specified in bytes

				This program was based on the dd(1p) utility as specified in POSIX. While

				character conversion may have been the original intent of dd(1p), it is

				irrelevant to its modern use. Because of this, it eschews character conversion

silt commented

2024-05-08 03:46:12 +00:00

Clarify the "it" here.

silt reviewed 2024-05-08 03:48:00 +00:00

docs/dj.1

 @ -149,3 +208,1 @@
 format that's easy to parse for machines. It also neglects character
 conversion, which may be dd's original intent but is irrelevant to its modern
 use.
 format that’s easy to parse for machines.