Commons:Bots/Requests
If you want to run a bot on Commons, you must get permission first. To do so, file a request following the instructions below.
Please read Commons:Bots before making a request for bot permission.
I | Create a user account (while logged in to your normal account) and user page for the bot
On the bot's userpage, add {{Bot}}, which automatically adds the page to Category:Commons bots. Then add the following information to the bot's userpage (all this is mandatory):
|
---|---|
II | Create your bot request:
Add your bot request to the list here:
|
III | Test run
You can be demanded to make a short test run with your bot account (30–50 edits/uploads) to allow other users to review your bot's tasks. Unauthorized test run is not allowed. |
IV | Waiting for approval.
You now need to wait for community approval. A bureaucrat will close the request and will also grant a bot flag, where necessary. Closed requests are moved to Commons:Bots/Archive. |
|
Requests made on this page are automatically transcluded in Commons:Requests and votes for wider comment.
Requests for permission to run a bot[edit]
Before making a bot request, please read the new version of the Commons:Bots page. Read Commons:Bots#Information on bots and make sure you have added the required details to the bot's page. A good example can be found here.
When complete, pages listed here should be archived to Commons:Bots/Archive.
Any user may comment on the merits of the request to run a bot. Please give reasons, as that makes it easier for the closing bureaucrat. Read Commons:Bots before commenting.
ImagehashBot (talk · contribs)[edit]
Operator:
- Zache (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
- Ipr1 (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: adding pHash checksum (P9310) and Imagehash difference hash (P12563) values to the photos.
- Documentation for the hashes
- pHash: https://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html
- dHash: https://www.hackerfactor.com/blog/index.php?/archives/529-Kind-of-Like-That.html
- Example images with P9310 and P12563 values
- https://commons.wikimedia.org/wiki/File:Patricia-Seppala-1995.jpg
- https://commons.wikimedia.org/wiki/File:Jean-Sibelius-1927.jpg
- https://commons.wikimedia.org/wiki/File:Potato_crop_lifting_(JOKAMT2Ju29-2).tif
First targets are photos from Europeana, Estonian, Finland, Sweden and Flickr, but long term target is to add imagehashes to all commons photos. Currently we have used FinnaUploadBot for Finna images. Reason for the new account is to make dedicated account and service for the non Finna related edits.
Automatic or manually assisted: automatic
Edit type (e.g. Continuous, daily, one time run): first a batch jobs, later continuous
Maximum edit rate (e.g. edits per minute):
Bot flag requested: (Y/N): Y
Programming language(s):
Zache (talk) 15:08, 12 April 2024 (UTC)
- Discussion
- What is use for such hashes? --EugeneZelenko (talk) 14:47, 13 April 2024 (UTC)
- One can use them to compare the similarity of pictures by checking how much the identifiers differ to detect duplicates and match photos in different repositories. We have used image hashes to prevent duplicates when uploading files and to prevent the wrong photos from being updated when reuploading photos from Finna with better quality and/or updating metadata. --Zache (talk) 16:31, 13 April 2024 (UTC)
- Such hashes make much more sense as part of Commons database. --EugeneZelenko (talk) 14:26, 14 April 2024 (UTC)
- In SDC they are filemetadata and in particular using SPARQL it would be easy way for querying and sharing the hashes for external usage. Ie. it is part of metadata for the files. Zache (talk) 14:52, 14 April 2024 (UTC)
- Also, even if the information would be added to the Wikimedia Commons database (there are good technical reasons why one would like to use an external service instead of adding this to the MediaWiki core), I would like to note that we are populating SDC values from the Commons internal database using bots. Most notable in this context are the SHA-1 checksum, mime type, image width, and image height. (Commons:Structured data/Modeling/Meta) And yes, there would be probaply better ways to do this, but currently using bots is the preferred method. --Zache (talk) 06:42, 18 April 2024 (UTC)
- Is there any community discussion that such data shall be generated at large scale? Krd 06:53, 18 April 2024 (UTC)
- I am not aware that there would have been a wider discussion. Current discussions, to my knowledge, are related to the Fæ's User:Fæ/Imagehash and village pump discussions 1 and 2. In my structured data property proposal in 2021, there were no follow-up comments in Wikimedia Commons. Phabricator has some tickets (for example, phab:T121797) related to image hashing.
- Also, just for background, I am running ImageHash-Toolforge, which has approximately 25% of Wikimedia Commons bitmap images (jpg, tiff, png) indexed with phash and dhash. I also made a Wikimania lightning talk proposal for it. (Proposals are currently under review.) My current idea was to proceed gradually when adding values to SDC, and my current personal need was to add hashes to European and Estonian photos before the Wikimedia Hackathon, Tallinn, in May so they would be available there. (see my question in Commons_talk:Bots/Requests#Extending_FinnaUploadBot).
- However, if you think I should do the village pump discussion or the discussion on the Structured Data talk pages, I am happy to start these. --Zache (talk) 07:49, 18 April 2024 (UTC)
- Please do. Krd 05:48, 21 April 2024 (UTC)
- Is there any community discussion that such data shall be generated at large scale? Krd 06:53, 18 April 2024 (UTC)
- Such hashes make much more sense as part of Commons database. --EugeneZelenko (talk) 14:26, 14 April 2024 (UTC)
- One can use them to compare the similarity of pictures by checking how much the identifiers differ to detect duplicates and match photos in different repositories. We have used image hashes to prevent duplicates when uploading files and to prevent the wrong photos from being updated when reuploading photos from Finna with better quality and/or updating metadata. --Zache (talk) 16:31, 13 April 2024 (UTC)
APPERbot (talk · contribs)[edit]
Operator: Wurgl (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: One-Time job: Remove 550+ languagelinks to not existing pages in de-WP, see Commons:Village_pump/Technical#Interwikilinks_to_german_wikipedia_with_no_corresponding_german_article
Update some the Graphic File:Normdatenentwicklung-de-wikipedia.svg 4 times a year, it already is created on the tools-Servers
Automatic or manually assisted: Automatic
Edit type (e.g. Continuous, daily, one time run):
Maximum edit rate (e.g. edits per minute): 10 (like in deWP)
Bot flag requested: (Y/N): Y
Programming language(s): PHP
Wurgl (talk) 21:22, 9 April 2024 (UTC)
- Discussion
- I am running this bot in deWP since January 2017 the bot has currently 1.715.969 Edits, but only 927.842 since I control it.
- For this (first) one-time job look at https://persondata.toolforge.org/data/common_diff.txt there is a diff showing what the bot would do --Wurgl (talk) 21:22, 9 April 2024 (UTC)
- Please make test run. Please create bot's user page. --EugeneZelenko (talk) 14:55, 10 April 2024 (UTC)
DaxBot (talk · contribs)[edit]
Operator: DaxServer (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: Task #3 - Upload images from w:Capella Space Open Data collection of w:Synthetic-aperture radar captures. The dataset is released under CC-BY-4.0 https://www.capellaspace.com/gallery/. The radar data is rather duplicated in a few formats (example) but has the same PNG thumbnail preview image associated. I've de-duplicated on the PNGs and has no influence on which of the duplicate would be selected and linked from Commons. Uploads are tracked under Category:Files from Capella Space uploaded by DaxBot.
Automatic or manually assisted: Automatic (manually started)
Edit type (e.g. Continuous, daily, one time run): Quarterly
Maximum edit rate (e.g. edits per minute): 1-3
Bot flag requested: (Y/N): N
Programming language(s): https://n8n.io/-powered workflow with JavaScript snippets and MediaWiki API to upload
-- DaxServer (talk) 12:33, 5 April 2024 (UTC)
- Discussion
- Looks OK for me, but will be good idea to add country/region categories since bounding boxes are known a priori. --EugeneZelenko (talk) 15:02, 5 April 2024 (UTC)
- Ya, sounds good. Let me do that test -- DaxServer (talk) 15:11, 5 April 2024 (UTC)
Svetlov Artem Bot (talk · contribs)[edit]
Operator: Svetlov Artem (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought:
- Change date=2004-07-17 12:00:00 to date={{Taken on|2004-07-17 12:00:00|location=Russia}} or Russian subregions for photos in manually selected categories.
- Creating categories like Category:Tula_Oblast_photographs_taken_on_2004-07-17 and Category:Russia_photographs_taken_on_2004-07-17 if not exists.
- Remove [[Category:Russia photographs taken on 2004-07-17]] from file description if edit is success.
Automatic or manually assisted:
Automatic unsupervised on manually set categories by bot operator
Edit type (e.g. Continuous, daily, one time run): multiple manual run
Maximum edit rate (e.g. edits per minute): 10
Bot flag requested: (Y/N): Y
Programming language(s): pywikibot
Svetlov Artem Bot (talk) 18:26, 25 March 2024 (UTC)
- Discussion
- Please make test run. Also bot account should not be used for generic edits like this request. --EugeneZelenko (talk) 15:43, 26 March 2024 (UTC)
- Here is contributions: https://commons.wikimedia.org/wiki/Special:Contributions/Svetlov_Artem_Bot
- I used to run this bot from my main account last year, and user @User:Cryptic-waveform asked me to run it from bot account: https://commons.wikimedia.org/wiki/User_talk:Svetlov_Artem#c-Svetlov_Artem-20231107212100-Cryptic-waveform-20231107211900 Svetlov Artem Bot (talk) 16:18, 28 March 2024 (UTC)
- Please leave the bot account exclusively for your bot's edits.
- Looks good to me. Some files have duplicate {{Taken with}} tags like File:Zukovskiy industrial railway 2022-10 1664723936.JPG for example. Would it be possible to remove the dupes on the fly when dealing with the files? --Achim55 (talk) 18:49, 28 March 2024 (UTC)
- There was some manual inserted templates. Taken on and Taken with are completly diffirent templates, one for date, second for camera. I can not now to automatically edit Taken with template, it need complicated investigation of EXIF camera tags, witch allways differs from real camera names. Svetlov Artem (talk) 08:57, 4 April 2024 (UTC)
What is the reason to do such edits, in which way does it improve the file page? Is there any community discussion about doing this at large scale? --Krd 06:58, 18 April 2024 (UTC)
- I was asked by user @MasterRus21thCentury, may you comment? https://commons.wikimedia.org/wiki/User_talk:Svetlov_Artem#c-MasterRus21thCentury-20231026174700-Category:Russia_photographs_taken_on_2008-05-01
- Such categories useful for discovery photos in series: if someone photographed a train, bus or village house, there is possibility to see other photographs from next village or train line at same day from same user. Svetlov Artem (talk) 16:23, 18 April 2024 (UTC)
- Support this bot actions Юрий Д.К 01:56, 20 April 2024 (UTC)
NinoBot (talk · contribs)[edit]
Operator: Ignacio Rodríguez (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought:
- Dealing and cleaning {{Book}} template on spanish book files, for compatibility at Spanish Wikisource.
- Changing Author names in database form (LastName, FirstName, 1889-1934) into {{Creator}} templates when it applies.
- Simple tasks such as changing {{Description}} to {{Book}} on book files.
Automatic or manually assisted: Manually assisted
Edit type (e.g. Continuous, daily, one time run): intermitently
Maximum edit rate (e.g. edits per minute): 10 edits per minute
Bot flag requested: (Y/N): Y
Programming language(s): pywikibot
Ignacio Rodríguez (talk) 16:36, 8 March 2024 (UTC)
- Discussion
- Please make test run. --EugeneZelenko (talk) 15:59, 9 March 2024 (UTC)
- @EugeneZelenko I conducted a series of test, both here at Commons and on Wikidata. I finally opted for a less intrusive modality, that doesn't modify the Commons page too much, and instead just creates an entry on Wikidata and links it here. You can see the results
- File:Alma de niña. Irresponsable (1903).pdf here
- d:Q124799320 and here.
- Since the volume of edits won't be massive, and I would be doing this only occasionally, I'm not sure if I really need the bot flag. Still I will leave this request open for any remarks you might want to add. Regards, Ignacio Rodríguez (talk) 16:06, 9 March 2024 (UTC)
- So far I'm doing the bulk of the edits at Wikidata, but still one edit per book here (I'm doing this with my main account as they're script assisted edits, not automatic). I will edit the bot request to reflect another task I figured out: changing bare author references to {{Creator}} templates. Ignacio Rodríguez (talk) 06:17, 17 March 2024 (UTC)
- Revision #861215671 This is a prototypic edit. It moves parameters around as a side effect, and it may remove blank parameters it doesn't recognize, but filled parameters always stays the same. So, if it's mandatory to respect the parameter order, I will figure it out. Also it doesn't deal with the blank spaces either side of the equal sign (=). Ignacio Rodríguez (talk) 06:44, 17 March 2024 (UTC)
- If you are planning to do the edits with the bot account, please do the test with the bot account. If not, do you withdraw this request? Please advise. Krd 07:01, 18 April 2024 (UTC)
- @EugeneZelenko I conducted a series of test, both here at Commons and on Wikidata. I finally opted for a less intrusive modality, that doesn't modify the Commons page too much, and instead just creates an entry on Wikidata and links it here. You can see the results
GeertivpBot (talk · contribs)[edit]
Operator: Geertivp (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought:
- Add missing SDC depict statements on media files (File namespace)
- Add missing Wikidata Infobox template to Category pages (Category namespace)
Automatic or manually assisted: Automatically, but monitored
Edit type (e.g. Continuous, daily, one time run): Intermittently
Maximum edit rate (e.g. edits per minute): 8 edits per minute
Bot flag requested: (Y/N): Y
Programming language(s): Pywikibot, Python scripts are on GitHub:
- https://github.com/geertivp/Pywikibot/blob/main/add_image_from_sdc.py
- https://github.com/geertivp/Pywikibot/blob/main/copy_label.py
Test runs are here.
Geert Van Pamel (talk) 22:29, 3 January 2024 (UTC)
- Discussion
- Could you please elaborate how `depicts` is filled? For example, File:Novosibirsk Regional Museum at night 2.jpg should depict building and condition of shoot (night shoot) should be qualifier. --EugeneZelenko (talk) 15:40, 4 January 2024 (UTC)
- The image depicts a "night view of the Royal museum", expressed as SDC depicts (P180) nighttime view (Q28333482) with qualifier of (P642) City Trade House (Q19908752), based upon and generated by the original Wikidata statement Q19908752#P3451 City Trade House (Q19908752) nighttime view (P3451) (M19171168). By doing so, both the SDC depict statement in Wikimedia Commons and the Wikidata statement are describing the same fact.
- What qualifier would you use instead? Can you please elaborate more about the exact statement that you would create? Thanks. Geert Van Pamel (talk) 16:23, 4 January 2024 (UTC)
- Maybe you would like to see: depicts (P180) City Trade House (Q19908752) with qualifier depicted format (P7984) nighttime view (Q28333482)? Please give your point of view/preferences. Geert Van Pamel (talk) 14:24, 5 January 2024 (UTC)
- It makes sense to have broader discussion on matter of qualifiers. May be bot should be limited just for subjects for now? --EugeneZelenko (talk) 15:35, 5 January 2024 (UTC)
- @Geertivp: ? --Krd 14:34, 26 January 2024 (UTC)
- Or we might generate two statements without qualifiers:
- depicts (P180) City Trade House (Q19908752)
- depicted format (P7984) nighttime view (Q28333482) Geert Van Pamel (talk) 19:43, 26 January 2024 (UTC)
- @Geertivp: ? --Krd 14:34, 26 January 2024 (UTC)
- But here we have the problem that depicted format (P7984) may not be used as a qualifier: d:Property:P7984#P2302 => property scope constraint (Q53869507) as main value (Q54828448). In addition to that it can only be used with work of art (Q838948) entities and requires item-requires-statement constraint (Q21503247) genre (P136), which is in general not the case in this suggested usage. Which other qualifier property could be used instead? Geert Van Pamel (talk) 18:51, 18 February 2024 (UTC)
- I cannot follow. Please advise. --Krd 05:04, 7 March 2024 (UTC)
- @Geertivp: ? Krd 07:31, 31 March 2024 (UTC)
- In my previous reply I concluded that depicted format (P7984), because of its defined constraints, can't be used as a qualifier and only in some specific domains as work of art (Q838948). So we would need another more generic (currently not existing) property that could be used as a qualifier with a generic meaning of "type of representation", e.g. nighttime view (Q28333482) in the above example. Geert Van Pamel (talk) 20:53, 31 March 2024 (UTC)
- What does this mean regarding this request, what should be the conclusion? Krd 04:30, 3 April 2024 (UTC)
- We would need a new property "type of representation", that could be added as a qualifier to the Depict statements in Wikimedia Commons SDC for all images that are registered in Wikidata, and don't have a image (P18) property in Wikidata. Geert Van Pamel (talk) 07:45, 3 April 2024 (UTC)
- Are you going to take care of property creation? Krd 09:09, 3 April 2024 (UTC)
- See d:wikidata:property proposal/Type of representation Geert Van Pamel (talk) 19:28, 4 April 2024 (UTC)
- Thank you. Please advise when the discussion is closed. Krd 07:02, 18 April 2024 (UTC)
- See d:wikidata:property proposal/Type of representation Geert Van Pamel (talk) 19:28, 4 April 2024 (UTC)
- Are you going to take care of property creation? Krd 09:09, 3 April 2024 (UTC)
- We would need a new property "type of representation", that could be added as a qualifier to the Depict statements in Wikimedia Commons SDC for all images that are registered in Wikidata, and don't have a image (P18) property in Wikidata. Geert Van Pamel (talk) 07:45, 3 April 2024 (UTC)
- What does this mean regarding this request, what should be the conclusion? Krd 04:30, 3 April 2024 (UTC)
- In my previous reply I concluded that depicted format (P7984), because of its defined constraints, can't be used as a qualifier and only in some specific domains as work of art (Q838948). So we would need another more generic (currently not existing) property that could be used as a qualifier with a generic meaning of "type of representation", e.g. nighttime view (Q28333482) in the above example. Geert Van Pamel (talk) 20:53, 31 March 2024 (UTC)
- @Geertivp: ? Krd 07:31, 31 March 2024 (UTC)
- I cannot follow. Please advise. --Krd 05:04, 7 March 2024 (UTC)
- It makes sense to have broader discussion on matter of qualifiers. May be bot should be limited just for subjects for now? --EugeneZelenko (talk) 15:35, 5 January 2024 (UTC)
- Maybe you would like to see: depicts (P180) City Trade House (Q19908752) with qualifier depicted format (P7984) nighttime view (Q28333482)? Please give your point of view/preferences. Geert Van Pamel (talk) 14:24, 5 January 2024 (UTC)