User talk:SMcCandlish
Welcome to SMcCandlish's talk page. I will generally respond here to comments that are posted here, rather than replying via your talk page (or the article's talk page, if you are writing to me here about an article), so you may want to watch this page until you are responded to, or let me know where specifically you'd prefer the reply. |
This user talk page might be watched by friendly talk page stalkers, which means that someone other than me might reply to your query. Their input is welcome and their help with messages that I cannot reply to quickly is appreciated. |
No RfAs or RfBs reported by Cyberbot I since 8:58 9/27/2024 (UTC)
13 template-protected edit requests | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Updated as needed. Last updated: 16:01, 30 October 2024 (UTC) |
- recent changes
- purge this page
- view or discuss this template
Request name | Motions | Initiated | Votes |
---|---|---|---|
Wikipediocracy-related conduct | 21 October 2024 | 4/1/2 | |
Marine 69-71 | 26 October 2024 | 0/0/0 |
No cases have recently been closed (view all closed cases).
Request name | Motions | Case | Posted |
---|---|---|---|
Amendment request: Palestine-Israel articles (AE referral) | Motion | (orig. case) | 17 August 2024 |
No arbitrator motions are currently open.
News and updates for administrators from the past month (October 2024).
- Eligible editors are invited to self-nominate themselves from 3 November 2024 until 12 November 2024 to stand in the 2024 Arbitration Committee elections.
- RoySmith, Barkeep49 and Cyberpower678 have been appointed to the Electoral Commission for the 2024 Arbitration Committee Elections. ThadeusOfNazereth and Dr vulpes are reserve commissioners.
- The Arbitration Committee is seeking volunteers for roles such as clerks, access to the COI queue, checkuser, and oversight. Applications are welcome at any time throughout the year.
Most recent poster here: Gerda Arendt (talk)
Mini-toolbox:
- My Wikimedia Library (journal access, etc.; to get your own, see WP:LIBRARY)
- Wikipedia:AutoWikiBrowser/Script (req. WP:AWB access and JWB installed or is just a normal redlink)
- Special:LintErrors
- Hunt down abuse of
{{em}}
for non-emphasis italics [1] — and<em>
[2] - Move and redirect articles with slashes in their titles when feasible (i.e. when not proper names that require them)
- NAC-at-ANRFC geekery to remember
- NAC-at-RM geekery to remember
- Ref consistency checker (use in preview or sandbox):
{{ref info|Manx cat|style=float:right}}
- Reliably regex match a single linebreak in wikicode (or elsewhere):
(\r\n|\r|\n)
- Helpful links related to WP:MEATBOT, WP:COSMETICBOT, and code cleanup: WP:EDITORFRIENDLY (a.k.a. WP:EDITORHOSTILE), WP:COSMETIC (a.k.a. WP:SUBSTANTIVE), WP:SPECTRUM
- All WP:CUE project participants should watchlist this alerts page.
Redirects for discussion
- 01 Sep 2024 – Eight-Ball (talk · edit · hist) →Eight-ball was RfDed by Thryduulf (t · c); see discussion
Featured list candidates
- 10 Oct 2024 – 1990–91 snooker world rankings (talk · edit · hist) was FL nominated by BennyOnTheLoose (t · c); see discussion
Good article nominees
- 10 Oct 2024 – Kingsley Kennerley (talk · edit · hist) was GA nominated by BennyOnTheLoose (t · c); start discussion
- 05 Oct 2024 – Tessa Davidson (talk · edit · hist) was GA nominated by BennyOnTheLoose (t · c); start discussion
- 25 Sep 2024 – Mink Nutcharut (talk · edit · hist) was GA nominated by BennyOnTheLoose (t · c); start discussion
- 30 Aug 2024 – Thelma Carpenter (billiards player) (talk · edit · hist) was GA nominated by BennyOnTheLoose (t · c); start discussion
- 08 Jun 2024 – 2024 World Snooker Championship (talk · edit · hist) was GA nominated by Lee Vilenski (t · c); start discussion
- 07 Jun 2024 – 2024 Tour Championship (talk · edit · hist) was GA nominated by Lee Vilenski (t · c); start discussion
Good article reassessments
- 26 Oct 2024 – Bottle pool (talk · edit · hist) was nominated for GA reassessment by Z1720 (t · c); see discussion
Requested moves
- 16 Oct 2024 – Rudolf Wanderone (talk · edit · hist) is requested to be moved to Minnesota Fats by TenPoundHammer (t · c); see discussion
Other:
- MW Editing team e-meetings, /wikimedia.org/edit-tasktriage via Google Hangouts (Tuesdays, noon–12:30pm PDT = 20:00 UTC during DST, 19:00 otherwise, but often half an hour earlier).
- MW Tech Advice e-meetings, via IRC at #wikimedia-tech connect (Wednesdays, 1–2pm PDT = 16:00–17:00 UTC).
- meta:Talk:Spam blacklist – global blacklist requests
As of 2024-10-31 , SMcCandlish is Active.
|
|
|
Old stuff to resolve eventually
[edit]Cueless billiards
[edit]Extended content |
---|
Categories are not my thing but do you think there are enough articles now or will be ever to make this necessary? Other than Finger billiards and possibly Carrom, what else is there?--Fuhghettaboutit (talk) 11:12, 18 January 2010 (UTC)
Sad...[edit]How well forgotten some very well known people are. The more I read about Yank Adams, the more I realize he was world famous. Yet, he's almost completely unknown today and barely mentioned even in modern billiard texts.--Fuhghettaboutit (talk) 13:47, 21 January 2010 (UTC)
|
Some more notes on Crystalate
[edit]Extended content |
---|
Some more notes: they bought Royal Worcester in 1983 and sold it the next year, keeping some of the electronics part.[3]; info about making records:[4]; the chair in 1989 was Lord Jenkin of Roding:[5]; "In 1880, crystalate balls made of nitrocellulose, camphor, and alcohol began to appear. In 1926, they were made obligatory by the Billiards Association and Control Council, the London-based governing body." Amazing Facts: The Indispensable Collection of True Life Facts and Feats. Richard B. Manchester - 1991wGtDHsgbtltnpBg&ct=result&id=v0m-h4YgKVYC&dq=%2BCrystalate; a website about crystalate and other materials used for billiard balls:No5 Balls.html. Fences&Windows 23:37, 12 July 2011 (UTC)
|
Extended content |
---|
No one has actually objected to the idea that it's really pointless for WP:SAL to contain any style information at all, other than in summary form and citing MOS:LIST, which is where all of WP:SAL's style advice should go, and SAL page should move back to WP:Stand-alone lists with a content guideline tag. Everyone who's commented for 7 months or so has been in favor of it. I'd say we have consensus to start doing it. — SMcCandlish Talk⇒ ɖ∘¿¤þ Contrib. 13:13, 2 March 2012 (UTC)
|
You post at Wikipedia talk:FAQ/Copyright
[edit]Extended content |
---|
That page looks like a hinterland (you go back two users in the history and you're in August). Are you familiar with WP:MCQ? By the way, did you see my response on the balkline averages?--Fuhghettaboutit (talk) 15:54, 6 January 2013 (UTC)
|
Hee Haw
[edit]Extended content |
---|
Yeah, we did get along on Donkeys. And probably will get along on some other stuff again later. Best way to handle WP is to take it issue by issue and then let bygones be bygones. I'm finding some interesting debates over things like the line between a subspecies, a landrace and a breed. Just almost saw someone else's GA derailed over a "breed versus species" debate that was completely bogus, we just removed the word "adapt" and life would have been fine. I'd actually be interested in seeing actual scholarly articles that discuss these differences, particularly the landrace/breed issue in general, but in livestock in particular, and particularly as applied to truly feral/landrace populations (if, in livestock, there is such a thing, people inevitably will do a bit of culling, sorting and other interference these days). I'm willing to stick to my guns on the WPEQ naming issue, but AGF in all respects. Truce? Montanabw(talk) 22:40, 6 January 2013 (UTC)
|
Redundant sentence?
[edit]Extended content |
---|
The sentence at MOS:LIFE "General names for groups or types of organisms are not capitalized except where they contain a proper name (oak, Bryde's whales, rove beetle, Van cat)" is a bit odd, since the capitalization would (now) be exactly the same if they were the names of individual species. Can it simply be removed? There is an issue, covered at Wikipedia:PLANTS#The use of botanical names as common names for plants, which may or may not be worth putting in the main MOS, namely cases where the same word is used as the scientific genus name and as the English name, when it should be de-capitalized. I think this is rare for animals, but more common for plants and fungi (although I have seen "tyrannosauruses" and similar uses of dinosaur names). Peter coxhead (talk) 09:17, 3 May 2014 (UTC)
|
Note to self on WP:WikiProject English language
[edit]Extended content |
---|
Finish patching up WP:WikiProject English language with the stuff from User:SMcCandlish/WikiProject English Language, and otherwise get the ball rolling. — SMcCandlish ☺ ☏ ¢ ≽ʌⱷ҅ᴥⱷʌ≼ 20:22, 17 August 2016 (UTC) |
Excellent mini-tutorial
[edit]Extended content |
---|
Somehow, I forget quite how, I came across this - that is an excellent summary of the distinctions. I often get confused over those, and your examples were very clear. Is something like that in the general MoS/citation documentation? Oh, and while I am here, what is the best way to format a citation to a page of a document where the pages are not numbered? All the guidance I have found says not to invent your own numbering by counting the pages (which makes sense), but I am wondering if I can use the 'numbering' used by the digitised form of the book. I'll point you to an example of what I mean: the 'book' in question is catalogued here (note that is volume 2) and the digitised version is accessed through a viewer, with an example of a 'page' being here, which the viewer calls page 116, but there are no numbers on the actual book pages (to confuse things further, if you switch between single-page and double-page view, funny things happen to the URLs, and if you create and click on a single-page URL the viewer seems to relocate you one page back for some reason). Carcharoth (talk) 19:10, 12 September 2016 (UTC)
|
Extended content |
---|
You had previously asked that protection be lowered on WP:MEDMOS which was not done at that time. I have just unprotected the page and so if you have routine update edits to make you should now be able to do so. Best, Barkeep49 (talk) 06:42, 25 January 2020 (UTC)
|
Ooh...potential WikiGnoming activity...
[edit]Extended content |
---|
I stumbled upon Category:Editnotices whose targets are redirects and there are ~100 pages whose pages have been moved, but the editnotices are still targeted to the redirect page. Seems like a great, and sort of fun, WikiGnoming activity for a template editor such as yourself. I'd do it, but I'm not a template editor. Not sure if that's really your thing, though. ;-) Cheers,
|
Note to self
[edit]Extended content |
---|
Don't forget to deal with: Template talk:Cquote#Template-protected edit request on 19 April 2020. — SMcCandlish ☏ ¢ 😼 14:48, 20 April 2020 (UTC) |
Now this
[edit]Extended content |
---|
Not sure the ping went through, so noting here. Just spotted where a now-blocked user moved a bunch of animal breed articles back to parenthetical disambiguation from natural disambiguation. As they did it in October and I'm only catching it now, I only moved back two just in case there was some kind of consensus change. The equine ones are definitely against project consensus, the rest are not my wheelhouse but I'm glad to comment. Talk:Campine_chicken#Here_we_go_again. Montanabw(talk) 20:14, 25 June 2020 (UTC)
|
PGP
[edit]FYI, it looks like your key has expired. 1234qwer1234qwer4 21:57, 5 November 2022 (UTC)
- Aiee! Thanks, I'll have to generate a new one when I have time to mess around with it. — SMcCandlish ☏ ¢ 😼 22:32, 5 November 2022 (UTC)
German article on houndstooth, Border tartan, and related patters
[edit]de:Rapport (Textil) is an intersting approach, and we don't seem to have a corresponding sort of article. Something I might approach at some point. — SMcCandlish ☏ ¢ 😼 22:11, 23 March 2024 (UTC)
Current threads
[edit]Capitalization after a hyphen
[edit]Hey there. In 2020, you moved Three-Fifths Compromise to Three-fifths Compromise, with the edit summary WP:HYPHEN (don't capitalize after a hyphen unless what follows the hypen it itself a proper name).
I have a question about that: are you certain WP:HYPHEN is saying "if what follows a hyphen is a proper noun" rather than "if the hyphenated compound is a proper noun"? If your interpretation of the wording is accurate, then I would propose that the exemption for "titles of published works" be extended to all proper nouns. In the case of the Three-Fifths Compromise, plenty of sources capitalize "fifths", including AP, NYT, WaPo, Forbes, LA Times, and Guardian, etc. This is also an outlier, as we have articles like Coca-Cola ("cola" is not a proper noun), Spider-Man ("man" is not a proper noun), Quasi-War ("war" is not a proper noun), Employment Non-Discrimination Act ("discrimination" is not a proper noun), etc. InfiniteNexus (talk) 19:37, 1 December 2023 (UTC)
Collapse-boxing a long thread so I don't have to keep scrolling past it |
---|
|
The following rules apply to hyphenated terms appearing in a title capitalized in headline style [...]
- Always capitalize the first element.
- Capitalize any subsequent elements unless they are articles, prepositions, coordinating conjunctions ([...]), or such modifiers [...] following musical key symbols.
- If the first element is merely a prefix or combining form that could not stand by itself as a word (anti, pre, etc.), do not capitalize the second element unless it is a proper noun or proper adjective.
- Capitalize the second element in a hyphenated spelled-out number (twenty-one or twenty-first, etc.) or hyphenated simple fraction (two-thirds in two-thirds majority).
The examples that follow demonstrate the numbered rules [...]
- [...]
- Record-Breaking Borrowings from Medium-Sized Libraries (2)
- [...]
- Anti-intellectual Pursuits (3)
- A Two-Thirds Majority of Non-English-Speaking Representatives (3, 4)
- [...]
APA:
In title case, capitalize the following words in a title or heading:
- [...]
- major words, including the second part of hyphenated major words (e.g., "Self-Report," not "Self-report")
MLA:
When you copy an English-language title or subtitle [...] use title-style capitalization: capitalize the first word, the last word, and all principal words, including those that follow hyphens in compound terms.
[...]
Do not capitalize the word following a hyphenated prefix if the dictionary shows the prefix and word combined without a hyphen.
- Theodore Dwight Weld and the American Anti-slavery Society
AMA:
In titles, subtitles, and text headings, do not capitalize the second part of a hyphenated compound in the following instances:
If either part is a hyphenated prefix or suffix (see [...])
- Nonsteroidal Anti-inflammatory Drugs for Ankylosing Spondylitis
If both parts together constitute a single word (consult [...])
- Reliability of Health Information Obtained Through Online Searches for Self-injury [...]
- Short-term and Long-term Effects of Violent Media on Aggression in Children
- [...]
However, if a compound is temporary or if both parts carry equal weight, capitalize both words.
- [...]
- Low-Level Activity
- Drug-Resistant Bacteria
- [...]
In titles, subtitles, and text headings, capitalize the first letter of a word that follows a lowercase (but not a capital) Greek letter (see [...]), a numeral ([...]), a symbol, a stand-alone capital letter, or an italicized organic chemistry prefix, [...]
AP makes no mention of capitalization after a hyphen, but "The Star-Spangled Banner" is given as an example of a title (which we also capitalize, would you look at that).
InfiniteNexus (talk) 18:48, 4 December 2023 (UTC)
Post-holiday followup
[edit]@InfiniteNexus: More research of the above sort is needed. To just dive in and do one bit of it, I find that MHRA Style Guide [15] has a ridiculously inconsistent rule to capitalize after a hyphen, even when it's a prefix that cannot stand alone, except when that prefix is specifically Re-. There is no rationale given for this weirdness. I think it would be worthwhile to look in other major style guides and see whether anything like a largely consistent pattern actually emerges. Your four American ones (at least two of which, APA and AMA, have been moving over time to be increasingly consistent with Chicago on many points) don't cover enough ground for us to be certain of this. And AMA is trying to be meaningful but failing dismally. "Short-term" and "low-level" are both the same kind of term; same goes for "self-injury" and "drug-resistant". They can all be split up without hyphens, without losing meaning: "A low level of drug resitance was observed over a short term in a study of patients admitted for self injury". (I guess this is what happens when medical people with no linguistics background try to write material about English-language structure and usage.) The unitary hyphenated compounds below cannot be split up this way (though some are sometimes colloquially written as hyphenless closed compounds: "knowhow" and "runnerup", but not "fatherinlaw").
Iff it turns out that there is a demonstrable lean across all major style guides, then we could probably encapsulate it with something simplified and easy to remember and apply, which might (more resesearch is needed) be something like:
In title case, capitalize after a hyphen when the compound is temporary (usually a multi-word modifier that would be written without hyphens if not used adjectivally): Real-Estate Demography, Remote-Control Operation, Common-Sense Guidelines. Do not capitalize after a hyphen if the term is a compound with:
- a prefix (Pre-eclampsia, Anti-establishment), unless what follows the hyphen is a proper name Neo-Aristotelian;
- a suffix (Dada-esque);
- a compound with a synergistic meaning separate from that of its parts and which is almost always hyphenated (Father-in-law, Know-how, Runner-up).
A construction like this would avoid AMA's categorical confusion; avoid highly debatable ideas like "constitute a single word", "if both parts carry equal weight", "principal words", "major words"; avoid "the dictionary" nonsense (there is no such thing as "the" dictionary, but lots of dictionaries which often conflict with each other and have different levels of prescriptive versus descriptive approach); and avoid nitpicky geekery no one is apt to care about, like musical key symbols and italicized organic chemistry prefixes (we should not address minutiae like that unless long-term dispute arises about it, per WP:CREEP and WP:MOSBLOAT).
However, I find the "the second element in a hyphenated spelled-out number" very dubious, and same with "-century" constructions; I have seen many titles of things that use "Twenty-two", "Fifty-third", and "Fourth-century"; this is one of several cases that needs more investigation in more style guides. And in the end, we are not required to do what a loose preponderance of other style guides seem to lean toward, especially when they contradict each other as to details and rationales; they are just duly informative with regard to what we decide. But we do need to decide something, since the extant material at MOS:TITLES has a gap, and people are not agreeing on what fits inside it.
PS: Your "The Star-Spangled Banner" is given as an example of a title (which we also capitalize, would you look at that)
smirking isn't constructive. You know as well as I do that WP content is not a source, and that editors doing stylistically questionable things at a particluar article has nothing to do with whether a style rule we have should be changed. More to the point, the style guides you quoted are not in agreement on it, and AMA for one would have it as "The Star-spangled Banner" because "star-spangled" is not a temporary compound but a poetic 18th-century neologism that is a unitary term and appears to have nearly no existence without the hyphen. MLA would also lower-case "-spangled" because of its dictionary rule [16]. — SMcCandlish ☏ ¢ 😼 01:43, 29 December 2023 (UTC)
- If you are still interested in looking into this, feel free to so, but right now I do not have time to continue delving into this matter. The four (five, if counting AP) style guides I looked at are probably the most widely used in the U.S., so it seems safe to assume that this is the norm among most external style guides. I don't have access to style guides from other countries. InfiniteNexus (talk) 07:35, 29 December 2023 (UTC)
Getting rid of {rp}
[edit]Applause! Now tell me how to get round wp:CITEVAR objections like this one: Talk:Eric Gill/Archive 1#Proposal to change citations of McCarthy's books to use harvard referencing and Talk:Eric Gill/Archive 1#Page number citations are expected when the source is a substantial book. I had hoped to get the Eric Gill article up to GA standard but I am too much of a secret typographer to put my name to a GAN, given its current spider-crawled-in-the-ink appearance. Sigh. 𝕁𝕄𝔽 (talk) 20:03, 18 December 2023 (UTC)
- @JMF: I think in the Gill case (if I'm reading it right), the other party's objection was to inline parenthetical referencing with page numbers, which the community also deprecated already, i.e. doing things like "This is a claim (Smith 2023, p. 7).", instead of "This is a claim.
<ref>Smith 2023, p. 7.</ref>
", or the templated equivalents "This is a claim.<ref>{{harv|Smith|2023|p=7}}</ref>
", or "This is a claim.{{sfn|Smith|2023|p=7}}
". It's actually possible that 14GTR was literally opposed to ever including page numbers in any form, in which case his argument has no WP:P&G legs to stand on and should just be ignored.Sudden flash of possible insight: A strong case can be made that because the community did clearly deprecate inline parenthetical referencing in 2020 (WP:PAREN), and the rationale for doing so was its reading-flow disruptiveness, not the fact that round-bracket characters were involved, this actually translates automatically to a deprecation of{{Rp}}
as well. It is simply another format for doing inline parenthetical referencing (its own documentation states explicitly that it's an adaptation from "full Harvard referencing and AMA style", though ultimately this is me quoting myself), just with fewer details and using superscript and colon, instead of more details with round brackets and no superscript or colon. That is, the deprecation is of citations that are inline and parenthetical, not inline and using what Americans call parentheses (round brackets). So, replacing "This is a claim (Smith 2023, p. 7)." but retaining "This is a claim<ref>Smith 2023.</ref>{{Rp|7}}
" to produce "This is a claim.[1]:7" is simply defying that site-wide consensus by still putting part of the citation (page numbers or other in-source locations) inline parenthetically – especially given that the template can be used to produce things like "This is a claim.[1]:viii–xiv, 7–9, 12, and back cover". Indeed, Wikipedia:Citing sources#Generally considered helpful already includes "converting parenthetical referencing to an acceptable referencing style". So, you could actually try that argument right now in doing cleanup of{{Rp}}
.Because of the "let chaos reign" stupidity that is WP:CITEVAR, some people are probably apt to try to argue against this, but I think their case will be weak and easily deflated. That said, probably the only path to total cleanup is going to be really fully documenting how to convert{{Rp}}
into other formats, and why it is a good idea, and why{{Rp}}
is bad, and then have a follow-up RfC or TfD to formally deprecate{{Rp}}
and mandate its replacement (mostly by AWB and sometimes even by bots for simple cases), so that it is no longer considered a valid "citation style" for CITEVAR purposes, no question about it. And I think the work in doing that documentation is going to be in my lap, though I'm not over-eager to wade into it right this second. It gives me a headache just thinking about it. — SMcCandlish ☏ ¢ 😼 21:04, 18 December 2023 (UTC)- The stonewall response was much as I expected though I had hoped that time and the offer of a ladder to climb down might just do the trick. AFAICS, the only way forward is to formally propose that {{rp}} be deprecated in favour of harvard referencing. Trouble is, when I tried to use the {{harvid}} method way back, I found it hostile. I persisted and matters much improved when I found {tl|sfnp}}. But other editors may have had their fingers burned and will resist, based on their bad experience way back when. So preparing the ground with explanation and education may be needed? --𝕁𝕄𝔽 (talk) 20:00, 20 December 2023 (UTC)
- Took me "a minute" to figure it out, too. I've started the slog of fully documenting how to replace
{{rp}}
, at User:SMcCandlish/Replacement of Template:Rp. Still needs some more info in it, and proofreading for any markup errors that mess up any of the code examples. — SMcCandlish ☏ ¢ 😼 09:46, 21 December 2023 (UTC)
- Took me "a minute" to figure it out, too. I've started the slog of fully documenting how to replace
- The stonewall response was much as I expected though I had hoped that time and the offer of a ladder to climb down might just do the trick. AFAICS, the only way forward is to formally propose that {{rp}} be deprecated in favour of harvard referencing. Trouble is, when I tried to use the {{harvid}} method way back, I found it hostile. I persisted and matters much improved when I found {tl|sfnp}}. But other editors may have had their fingers burned and will resist, based on their bad experience way back when. So preparing the ground with explanation and education may be needed? --𝕁𝕄𝔽 (talk) 20:00, 20 December 2023 (UTC)
Incremental updates
[edit]Update: This is going very slowly, but I'm committed to working on it. It's going to require a bunch of very well-tested regular expressions, used in series in a JS user script, to catch and clean up a large number of content use cases, so that it produces uniform citation formatting (and without breaking anything). My earlier-documented work toward that at the page mentioned above has already been surpassed, in code I'm developing off-site. I'll start building the regexes I'm working on into a JavaScript pretty soon and start testing that against real content and refining it. After it reliably works for all valid and most sane but invalid test cases, then we'll be able to do search–replace operations against {{rp}}
that will have predictable results with minimal errors. This is going to be a big project. It was more difficult than I expected because XML syntax (much less XML mixed with a {{...}}
syntax!) is incredibly difficult to parse accurately with regex (or anything else for that matter) reliably. I've been using advanced tools like regex101.com with complex blobs of valid and invalid test-case input, and using ChatGPT to try to work out particularly thorny matching failures, and so on. As an example, just one of the regexes developed so far looks like <ref\s+name\s*=\s*(?:"\s*([^"](?:(?!\s*\/>|\s*"\s*>|\s+(?:group|follow|extends)).)*?)\s*"|'\s*([^'"](?:(?!\s*\/>|\s*'>|\s+(?:group|follow|extends)).)*?)\s*'|([^"](?:(?!\s*\/>|\s*>|\s+(?:group|follow|extends)).)*))\s*(?:(\/)|)>
, and even this cannot yet handle <ref name=foo group=bar>
to normalize the name=
part, only to avoid breaking a ref that has a group=
part (and it does not do anything to normalize the latter part yet, only the former). — SMcCandlish ☏ ¢ 😼 00:06, 28 December 2023 (UTC)
- It now parses even stuff like
<ref group="bar's > / bar" extends=baz name='foos > / foo' follow="quux quux" />
(and some of the code it's accounting for is only in the beta of mw:Help:Cite and not deployed on en.WP yet), though this one regex only fixes up thename=
parameter; other passes with similar regexes would handle other attributes likegroup=
to normalize their formatting. Then another pass to fix spacing that shouldn't exist between citations. And so on. And of course a pass to replace{{rp}}
with{{sfnp}}
or whatever. Like I say, a multi-step process that'll be done by using the regexes in JS. The regex in question is now the monstrous<ref\s+((?:group|follow|extends)\s*=(?:(?!name\s*=).)*)?name\s*=\s*(?:"\s*([^"](?:(?!\s*\/>|\s*"\s*>|\s+(?:group|follow|extends)).)*?)\s*"|'\s*([^'"](?:(?!\s*\/>|\s*'>|\s+(?:group|follow|extends)).)*?)\s*'|([^"](?:(?!\s*\/>|\s*>|\s+(?:group|follow|extends)).)*))(\s+(?:group|follow|extends)\s*=(?:(?!\s*\/>|\s*>).)*)*\s*(?:(\/)|)>
. I'm suprised I pulled this off. Its one failure is that it can't gracefully handle the XML-valid (but technically ref-invalid) formname='foo "bar" baz'
(single-quoted value with nested double quotes) or the completely invalidname="foo "bar" baz">
; that's something that'll need to be handled by an earlier cleanup pass that looks just for those specific problems. — SMcCandlish ☏ ¢ 😼 02:02, 28 December 2023 (UTC)- Regex upgraded again, to handle line-breaking between
<ref>
attributes, as well as > inside quoted attributes aftername=
.- The new regex (just for handling
name
with or without other attributes present) is:<ref\s+((?:group|follow|extends)\s*=(?:(?!name\s*=)[\s\S])*)?name\s*=\s*(?:"\s*([^"](?:(?!\s*\/>|\s*"\s*>|\s+(?:group|follow|extends)).)*?)\s*"|'\s*([^'"](?:(?!\s*\/>|\s*'>|\s+(?:group|follow|extends)).)*?)\s*'|([^"](?:(?!\s*\/>|\s*>|\s+(?:group|follow|extends)).)*))(\s+(?:group|follow|extends)\s*=(?:(?!\s*\/>|"\s*>|'\s*>)[\s\S])*)*\s*(?:(\/)|)>
- It is already sophisticated enough to handle input as awful as:
<ref group= "bar's > / bar" extends= baz name= ' foos > / foo ' follow= "quux > quux" />
- Even
<syntaxhighlight>
can't deal with the above, but what I'm writing can. This one just cleans upname
(toname="foos > / foo"
from the above mess, and gets rid of the line break before the closing/>
while we're at it); similar regexes in later passes will deal withgroup
, etc., then eventually{{rp}}
replacement. — SMcCandlish ☏ ¢ 😼 03:27, 29 December 2023 (UTC) - Reminder to self: At some point, the script will also have to account for
{{#tag:ref |Citation content here. |name=... |group=... |follow=... |extends=...}}
(with parameters in various order and with or without linebreaks and extraneous spacing). — SMcCandlish ☏ ¢ 😼 03:37, 29 December 2023 (UTC)
- The new regex (just for handling
- Regex upgraded again, to handle line-breaking between
- It now parses even stuff like
Intervene?
[edit]Have you seen Help talk:Citation Style 1/Archive 92#Automating conversion of REF-plus-Rp to Sfn((m)p)? Do you want to launch a teaser trailer? Your call. --𝕁𝕄𝔽 (talk) 18:22, 28 December 2023 (UTC)
- @JMF: Done. — SMcCandlish ☏ ¢ 😼 01:43, 29 December 2023 (UTC)
Vauthors
[edit]Possibly telling people how to write harv citations is out of scope but I thought I should flag this one for you to include or ignore, your call. I've only just found the {{ref={{sfnref|blah blah}} }} facility and it is a lot more convenient that adding first=/last= to each and every name, just so you can write {{sfnp|last1|last2|last3|last4|2024}}. Here is a test example:
- Wang T, Mo L, Mo C, Tan LH, Cant JS, Zhong L, Cupchik G (June 2015). "Is moral beauty different from facial beauty? Evidence from an fMRI study". Social Cognitive and Affective Neuroscience. 10 (6): 814–23. doi:10.1093/scan/nsu123. PMC 4448025. PMID 25298010.
Up to you. --𝕁𝕄𝔽 (talk) 16:56, 29 December 2023 (UTC)
|vauthors=
in the documentation and scripting eventually. But |vauthors=
should not be used except in an article entirely done in Vancouver-style references (or it's against WP:CITESTYLE's instructions to use a consistent referencing style). It's a poor idea to use that style in the first place because it outputs less-useful author metadata, and much more importantly is harder to parse for readers (it is less clear that something like "Tan LH" is an individual's name than "Tan, L. H." that matches the rest of our initials formatting and other name handling, most especially when "Tan LH" appears in an article otherwise using citations that output "Tan, L. H."), and it's more error-prone for editors because this weird name formatting must be done exactly perfectly in that parameter. Another serious fault with it is that we often actually know complete author names (and these can be quite helpful in distinguishing authors and even in finding the source in the first place if it's something without a free-to-read URL or DOI), but |vauthors=
forces us to drop most of the name information we already have; it's a disservice to readers and to editors doing verification work. Any time I run into a |vauthors=
in an article that is not consistently in Vanc style, I replace it with a set of |last1=
|first1=
... (unless I'm in a big hurry or something), often with more complete author names.Using |ref={{sfnref|...}}
a.k.a. |ref={{harvid|..}}
isn't dependent in any way on |vauthors=
.Also, the Lua behind the citation templates can already parse the names inside |vauthors=
(if they were done right) and use them with {{sfnp}}
, {{harvp}}
, etc., directly. If we remove the |ref={{sfnref|Wang ''et al''|2015}}
from your example: Here is a claim in the article.[1]
Wang T, Mo L, Mo C, Tan LH, Cant JS, Zhong L, Cupchik G (June 2015). "Is moral beauty different from facial beauty? Evidence from an fMRI study". Social Cognitive and Affective Neuroscience. 10 (6): 814–23. doi:10.1093/scan/nsu123. PMC 4448025. PMID 25298010.
Just using the automated {{sfnp|Wang|Mo|Mo|Tan|2015}}
is clearer and easier than using a |ref={{sfnref|Wang ''et al''|2015}}
along with {{sfnp|Wang ''et al''|2015}}
. And there doesn't seem to be a consensus that "et al." should be italicized as Latin, because it is so assimilated into English, like "i.e." and "e.g."; I don't think any of our citation templates italicize it. (But it should have a "." after it, italicized or not, even in British usage, since it's a truncation abbreviation, of et alia.) Even without the italics, just using the automated {{sfnp|Wang|Mo|Mo|Tan|2015}}
is still clearer and easier than using a |ref={{sfnref|Wang et al.|2015}}
along with {{sfnp|Wang et al.|2015}}
. — SMcCandlish ☏ ¢ 😼 23:23, 29 December 2023 (UTC)
- Ah, I didn't realise that {{sfnp}} was able to deconstruct a vauthors list. I could have saved myself a lot of hassle. Now I've given myself some more hassle to redo it properly. ;-^
- (I too prefer to change a vauthors list to
|first1= last1= first2= last2=
etc. Generally I avoid using it when creating a citation except when the authors are Chinese or Japanese but the article is in English: how do I know if it is last=Mao first=Tse Tung or vice versa? I confess to using it too when ten authors are listed, for example on IPCC papers.) --𝕁𝕄𝔽 (talk) 17:18, 30 December 2023 (UTC)- Well, like I said at the other page, no one's ever going to be "punished" for mixing citation styles. :-) Someone else just might rearrange it later. It can be a hassle. I got pretty irritated in fixing a vauthor instance stuck into an otherwise non-Vancouver article, as it had over 30 authors. I've seen someone reduce this to the first four last/first pairs then do
|display-authors=etal
, but I'm a little down on that because we had more author information and doing that deleted it. I think I'll whip up a script to convert from vauthors to last/first, at least for my own convenience, but probably after doing this big ref-cleaner and rp-replacer job first.As for Asian names, I would guess just go by what the publication says; if it's "Chaudhary, C.; Richardson, A. J.; ...", and had a "Hua, X." or rarely but sometimes in Sinological material "Hua X" with no comma, in the author list, that already indicates the family-name order. But if the paper's author list started with "Chetan Chaudhary, Abigail Richardson, ..." and included something like "Hua Xiang" then it could be ambiguous; did they keep the same order, or give the Chinese names in surname-first order? I'm not sure vauthors would help here, since you wouldn't be sure whether to use "Hua X" or "Xiang H". Some familiarity with East Asian naming patterns helps. A name like "Hua Xhiangshu" or "Hua Xhian-shu" or "Hua Xiang-Shu" (orthography varies) would be family-name-first. People with more experience at it than I have can figure out Japanese names just by familiarity with which are usually given and which family names. Korean I'm generally at a loss with, unless it follows the Chinese pattern ("Lee Joon-gi" or "Lee Joon-Gi" or "Lee Joongi" is surname-first). It helps a little that a few Korean family names are overwhelmingly common, like Park/Pak/Paik, Lee/Li, Jun/Joon/June, Song/Sung, and Kim. When I'm unsure, I usually just Google around for other works by the same person until I can figure it out. If I could not at all, I would probably do
|author4=Hua Xiang
using the name order I had found (at all or most commonly) and leave it for someone with language/culture-specific experience to figure it out later. Maybe put in an HTML comment to this effect. — SMcCandlish ☏ ¢ 😼 22:52, 30 December 2023 (UTC)PS: As I understand it, the vauthors to sfnp/harvp "translation" uses the author names up to the first four. I'm not sure what happens when someone has a main cite with
|vauthors=Chaudhary C, Richardson AJ, Hua X
|display-authors=etal
. I'm not sure if the latter is just a visual injection of "et al.", or whether it counts as a fourth author name and would require{{sfnp|Chaudhary|Richardson|Hua|et al.|2023}}
. I suspect not, but something to test in a sandbox. — SMcCandlish ☏ ¢ 😼 23:04, 30 December 2023 (UTC)
- Well, like I said at the other page, no one's ever going to be "punished" for mixing citation styles. :-) Someone else just might rearrange it later. It can be a hassle. I got pretty irritated in fixing a vauthor instance stuck into an otherwise non-Vancouver article, as it had over 30 authors. I've seen someone reduce this to the first four last/first pairs then do
Games
[edit]S, I know you're into games and their capitalizations, so take a look at List of abstract strategy games. I downcased a whole bunch of games listed there already, but there are a few I'm not sure what to do about, such as Connect Four, that might be trademarks, or might be generic. Do you have any insights or advice on those? Dicklyon (talk) 07:25, 20 December 2023 (UTC)
- Will have a look-see, but am in middle of some detailed thangs. — SMcCandlish ☏ ¢ 😼 09:29, 21 December 2023 (UTC)
- @Dicklyon: For that one, we have an article title of Connect Four, and a lead that begins "Connect Four (also known as Connect 4, Four Up, Plot Four, Find Four, Captain's Mistress, Four in a Row, Drop Four, and Gravitrips". "Connect Four" does appear to be sourced as a Milton-Bradley (now Hasbro, after merger) trademark, along with the later "Connect 4" spelling. And in English, it is probably the WP:COMMONNAME even if we'd prefer otherwise. It is possible some of the other names are trademarks (or constitute titles of works in the form of commercially published variants of this game, more specifically), but would need to be investigated one-by-one, with those that are not trademarks being lower-cased. And it might be more WP:NPOV to rewrite most of the article to use one of the lowercased non-TM names, and only use "Connect Four" or "Connect 4" when referring to specific MB–Hasbro products/publications.What is presently at "score four" seems like it should be "Score Four" (trademark of Funtastic in 1968, AKA "Connect Four Advanced" by Hasbro later); there doesn't appear to be a generic name for that variant. And I'm skeptical it is a valid stand-alone article instead of a section at Connect Four, anyway; looks like it would not pass a GNG test at AfD.I didn't look closely at other examples. Were there some other iffy ones? — SMcCandlish ☏ ¢ 😼 01:43, 29 December 2023 (UTC)
- Sure, lots of potentially iffy ones. Like what I did here. You concur? And what about things like Five Field Kono that are usually capped in sources, for no apparaent reason? Dicklyon (talk) 05:51, 29 December 2023 (UTC)
- The cleanup at peralikatuma looks spot-on to me. And five field kono (not even a redirect there? FFS ....) is a folk game, not a trademark/publication, so should be lower-case, and as: five-field kono (
five-field {{lang|ko-Latn|kono}}
) – per MOS:HYPHEN and MOS:FOREIGN. The parent article gonu has similar issues. This is the kind of stuff MOS:GAMECAPS is specifically aiming to address (along with overcapitalization of things like sports, folk dances, sport/dance moves and techniques, game pieces, musical instruments, etc.). For at least the immediate future, we have one weird exception, for go (game), which is presently being rendered "Go", but obviously really should be go ({{lang|zh-Latn|go}}
), but we would need another RfC to undo the previous one that arrived at "Go" through what seems to be a WP:SSF-based WP:LOCALCONSENSUS. But in no way is "Go" some kind of "capitalize all Asian folk games" excuse. So, kono/gonu. — SMcCandlish ☏ ¢ 😼 06:20, 29 December 2023 (UTC)- Thanks, I fixed some more of those. There's still a ton of over-capping in games generally though. Dicklyon (talk) 18:27, 29 December 2023 (UTC)
- Yes there is. Probably still in a lot of dance articles, too, though I cleaned up a lot of those. Sports mostly look pretty good, but I still run into obscure ones over-capitalizing stuff. — SMcCandlish ☏ ¢ 😼 23:24, 29 December 2023 (UTC)
- Thanks, I fixed some more of those. There's still a ton of over-capping in games generally though. Dicklyon (talk) 18:27, 29 December 2023 (UTC)
- The cleanup at peralikatuma looks spot-on to me. And five field kono (not even a redirect there? FFS ....) is a folk game, not a trademark/publication, so should be lower-case, and as: five-field kono (
- Sure, lots of potentially iffy ones. Like what I did here. You concur? And what about things like Five Field Kono that are usually capped in sources, for no apparaent reason? Dicklyon (talk) 05:51, 29 December 2023 (UTC)
Dahua Technology
[edit]Hi SMcCandlish, I noticed that you are part of the category of Wikipedians willing to provide third opinions [17]. I have been working on Dahua Technology and am hoping you may be interested in reviewing an ongoing discussion on the talk page regarding specific terminology used in the article. I'd be grateful for your feedback and assistance in implementing the edits as you see fit. Thank you, Caitlyn23 (talk) 19:23, 20 December 2023 (UTC)
- Will try to look into it tomorrow, but it's been a long day for me already. — SMcCandlish ☏ ¢ 😼 09:31, 21 December 2023 (UTC)
- More like the next day or day after; have a lot going on. — SMcCandlish ☏ ¢ 😼 10:00, 22 December 2023 (UTC)
- Hi SMcCandlish, just checking back to see whether you may have time to review the discussion on the Dahua Technology talk page. I'd be interested to hear your thoughts and would appreciate assistance with the edits. Thanks again, Caitlyn23 (talk) 18:22, 28 December 2023 (UTC)
- More like the next day or day after; have a lot going on. — SMcCandlish ☏ ¢ 😼 10:00, 22 December 2023 (UTC)