MediaWiki:Titleblacklist: Difference between revisions

From Inkipedia, the Splatoon wiki
(Generic image file names)
(Improved regex to cover variants. The "farther" exception was moved to MediaWiki:Titlewhitelist as it should be.)
Line 42: Line 42:
  .*Massimo ?Chiacchio.*
  .*Massimo ?Chiacchio.*
  .*Hunter ?Mariner.*
  .*Hunter ?Mariner.*
  .*shit(?!ake).*
  .*shit.*
  .*cum(?! l|\w).*
  .*cum(?! l|\w).*
  .*nigg.*
  .*nigg.*
Line 65: Line 65:
  .{38,} <newaccountonly | errmsg=titleblacklist-username-character-limit> # Username limit of 32 characters. The regex to match 38 or more characters is used because "User:" is part of the title.
  .{38,} <newaccountonly | errmsg=titleblacklist-username-character-limit> # Username limit of 32 characters. The regex to match 38 or more characters is used because "User:" is part of the title.
  .*review.* <newaccountonly | errmsg=titleblacklist-username-spammer> # Recent spammers tend to have "Review" in their name
  .*review.* <newaccountonly | errmsg=titleblacklist-username-spammer> # Recent spammers tend to have "Review" in their name
  .*f[a4]r[t7](?!her).* <newaccountonly | errmsg=titleblacklist-username-spammer> # fart
  .*f ?[a4] ?r ?[t7].* <newaccountonly | errmsg=titleblacklist-username-spammer> # fart
  .*p[e3]{2}y[e3]w.* <newaccountonly | errmsg=titleblacklist-username-spammer> # peeyew
  .*p ?[e3] ?[e3] ?y ?[e3] ?w.* <newaccountonly | errmsg=titleblacklist-username-spammer> # peeyew
  .*p[o0]{2}p.* <newaccountonly | errmsg=titleblacklist-username-spammer> # poop
  .*p ?[o0] ?[o0] ?p.* <newaccountonly | errmsg=titleblacklist-username-spammer> # poop
.*pee yew.* <newaccountonly | errmsg=titleblacklist-username-spammer> #pee yew
  .*p [o0] [o0].* <newaccountonly | errmsg=titleblacklist-username-spammer> # p o o
.*f a r t.* <newaccountonly | errmsg=titleblacklist-username-spammer> #f a r t
  .*p e e y e w.* <newaccountonly | errmsg=titleblacklist-username-spammer> #p e e y e w
.*p o o.* <newaccountonly | errmsg=titleblacklist-username-spammer> #p o o


  # Obscure ASCII character lookalikes (copied from Wikipedia's title blacklist)
  # Obscure ASCII character lookalikes (copied from Wikipedia's title blacklist)

Revision as of 06:20, 25 April 2022

# This is a title blacklist. Titles and users that match a regular expression here cannot be created.
# Use "#" for comments.
# This is case insensitive by default
# Be specific to avoid the Scunthorpe Problem.
#
# Each entry may also contain optional attributes, enclosed in <> and separated with |
# * autoconfirmed - only autoconfirmed users are able to create/upload/move such pages
# * casesensitive - don't ignore case when checking title for being blacklisted
# * noedit - users are also unable to edit this article
# * moveonly - forbid moves but allow ordinary creation (rev:35163)
# * newaccountonly - forbid creation of matching usernames, but allow page creation (rev:38977)
# * reupload - allow reuploads of existing blacklisted files (rev:33656)
# * errmsg - the name of the message that should be displayed instead of standard
#
# Edit this page to view the blacklist.
# 

#

\? <newaccountonly>
\+ <newaccountonly>
% <newaccountonly>
& <newaccountonly>
@ <newaccountonly>
(  +)|\t <newaccountonly>
Insitblog[0-9]{2,3} <newaccountonly>
User\:Insitblog[0-9]{2,3}
.*invoice ?factoring.*
.*\bNu Skin\b.*
Cento[\ *]e\ *.*
.*\bgay\b.*
.*fuck.*
.*penis.*
.*JEWS DID .*
.*\bBITCH\b.*
.*\bCOCK\b.*
.*\bASS\b.*
.*\bGOATSE\b.*
.*smaller.than.average.*
.*have sex.*
.*\bis\s+an?\s+(?:dick|cunt|fag|bitch|shit|fuck|loser|ass|gay|ghey|moron|retard|stupid|slut|pa?edo).*
.*[Ll][Oo][Ll].*[Ww][Uu][Tt].*
.*\bnimp\.org.*
.*Massimo ?Chiacchio.*
.*Hunter ?Mariner.*
.*shit.*
.*cum(?! l|\w).*
.*nigg.*
.*p\.?h\.?o\.?n\.?e\.? ?n?\.?u\.?m.?b.?e.?r.*
.*h\.?e\.?l\.?p\.? ?n?\.?u\.?m.?b.?e.?r.*
.*h\.?e\.?l\.?p\.? ?l?\.?i\.?n.?e.*
.*c\.?u\.?s\.?t\.?o\.?m\.?e\.?r\.? ?c?\.?a\.?r.?e.*
.*c\.?u\.?s\.?t\.?o\.?m\.?e\.?r\.? ?s?\.?e\.?r.?v.?i.?c.?e.*
.*n ?e ?t ?g ?e ?a ?r.*
.*n ?o ?r ?t ?o ?n.*
.*Loans.*
.*male ?enhancement.*
.*quick ?books.*
.*\bgmail\b.*
.*\bhotmail\b.*
.*\bmicrosoft\b.*
.*\bskype\b.* <newaccountonly>
.*\bgoogle\b.*
.*\b\(?\d{3}\W?\s?\d{3}\W?\d{4}\b.*
# Harmless usernames that are favored by vandals
.{38,} <newaccountonly | errmsg=titleblacklist-username-character-limit> # Username limit of 32 characters. The regex to match 38 or more characters is used because "User:" is part of the title.
.*review.* <newaccountonly | errmsg=titleblacklist-username-spammer> # Recent spammers tend to have "Review" in their name
.*f ?[a4] ?r ?[t7].* <newaccountonly | errmsg=titleblacklist-username-spammer> # fart
.*p ?[e3] ?[e3] ?y ?[e3] ?w.* <newaccountonly | errmsg=titleblacklist-username-spammer> # peeyew
.*p ?[o0] ?[o0] ?p.* <newaccountonly | errmsg=titleblacklist-username-spammer> # poop
.*p [o0] [o0].* <newaccountonly | errmsg=titleblacklist-username-spammer> # p o o
# Obscure ASCII character lookalikes (copied from Wikipedia's title blacklist)
.*[ℂ℃℄ɕƌʥℇ℈℉ℊℋℌℍℎℏℐ‼ℑℒℕ℗℘ℙℚℛℜℝ℞℟℣ℤℨ℩ℬℭ℮ℯℰℱℲℳℴℹ℺⅁⅂⅃⅄ⅅⅆⅇⅈⅉⅎ].* <casesensitive> # Select Unicode Letterlike Symbols (excluding Kelvin, Angstrom and Ohm signs)
.*[\x{249C}-\x{24E9}].* <casesensitive> # Circled and parenthesized Latin letters
.*[\x{FF21}-\x{FF3A}\x{FF41}-\x{FF5A}].* <casesensitive> # Fullwidth Latin letters
.*[?ʖ؟ʔ].* <casesensitive> # Question mark lookalikes
(?!(User|Inkipedia)( talk)?:|Talk:).*[\x{1D00}-\x{1DBF}].* <casesensitive> # Phonetic extensions
.*[\x{0250}-\x{02AF}].* <casesensitive> # IPA extensions
.*[∀-∑∟-∳⊂-⋕⋲-⋿].* <casesensitive> # Select mathematical operators (excluding "−", "∞" and some other common ones)
.*[\x{27C0}-\x{27ef}\x{2980}-\x{29ff}\x{2a00}-\x{2aff}].* <casesensitive> # Misc./supplemental mathematical symbols
.*[ⱧȢƪƪҤҺԊѓԍҥҩԌЃϓғҒЊӷⱤɌɍᵲᶉɼᵃᵍᴱᵣᔮᓂᑫᓈᒦ٨٣ץױוזשלﬨטּפּבּڠɆɇᶒⱸȺⱥᶏ١].* <casesensitive> # Letter lookalikes
# Other undesirable characters (copied from Wikipedia's title blacklist)
.*[\x{00A0}\x{1680}\x{180E}\x{2000}-\x{200B}\x{2028}\x{2029}\x{202F}\x{205F}\x{3000}].* <casesensitive> # Non-breaking and other unusual spaces
.*[\x{202A}-\x{202E}].* <casesensitive> # BiDi overrides
.*[^\P{So}¦§©®°¶№™☀-⚲✁-➾`].* <casesensitive> # "Other punctuation", with some exceptions
.*\p{Cc}.* <casesensitive> # Control characters
.*\x{FEFF}.* <casesensitive> # Byte order mark
.*[卍卐࿕࿖࿗࿘☭☠🖕].* <casesensitive> # Swastikas, hammer-and-sickle, skull-and-crossbones, middle finger
.*\x{00AD}.* <casesensitive> # Soft-hyphen
.*[^\0-\x{FFFF}].* <casesensitive> # Very few characters outside the Basic Multilingual Plane are useful in titles
.*[\x{2400}-\x{2426}].* <casesensitive> # Graphic pictures for control codes
# Excessive punctuation or repetition (copied from Wikipedia's title blacklist)
.*[!?‽¿]{3}(?<!!!!).*
.*[!?‽¿]{2}(?<!!!).* <moveonly>
.*[!?‽¿]\s+[!?‽¿].*
.*‽‽.* <moveonly> 
.*¿¿.* <moveonly>
.*[\p{Z}]{2}.* # Disallows two adjacent "separator" characters (mostly funky spaces)
.*[^\p{L}\d ]{6}.* # Disallows six consecutive characters that are not letters (in any script), numbers, or spaces
.*([^0])\1{4}.* <moveonly> # Disallows four or more of the same character from page moves
.*\p{Lu}(\P{L}*\p{Lu}){9}.* <casesensitive | moveonly>  # Disallows moves with more than nine consecutive capital letters
(.*[^0-9'])?'{2,}.*|.*'{2,}([^'\-].*)? # Multiple consecutive apostrophes get interpreted as wiki markup, so shouldn't be used in page titles
# Generic image file names (copied from Wikipedia's title blacklist)
# at most three letters of potentially meaningful text:
File:\P{L}*((Ima?ge?|Pict?(ure)?|Media|Photo)\P{L}+)?(\p{L}\P{L}*){0,3}((orig|copy|thumb|small)\P{L}*)?\.[^.]+  <reupload | errmsg=titleblacklist-generic-filename>
# no more than two contiguous letters:
File:\P{L}*((Ima?ge?|Pict?(ure)?|Media|Photo)[^\p{L}\p{M}]\P{L}*)?((\p{L}\p{M}*){1,2}[^\p{L}\p{M}]\P{L}*)*((orig|copy|thumb|small)\P{L}*|(\p{L}\p{M}*){1,2})?\.[^.]+  <reupload | errmsg=titleblacklist-generic-filename>
# month name followed by no more than two contiguous letters, JPEG suffix:
File:\P{L}*(January|Jan|February|Febr?|March|Mar|April|Apr|May|June?|July?|August|Aug|September|Sept?|October|Oct|November|Nov|December|Dec)(\P{L}+\p{L}{1,2})*\P{L}*\.JPE?G  <reupload | errmsg=titleblacklist-generic-filename>
# Common digital cameral file names, based on list at http://diddly.com/random/about.html
# See also MediaWiki:Filename-prefix-blacklist, used to generate a warning on the upload form
File:DCP\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # Kodak
File:DSC.\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # (Nikon, Fuji, Polaroid)
File:MVC-?\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # Sony Mavica
File:P[\dA-F]\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # Olympus, Kodak
File:I?MG[P_\s-]?\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # Canon, Pentax
File:1\d+-\d+(_IMG)?\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # Canon
File:(IM|EX)\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # HP Photosmart
File:DC\d+[SML]\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # Kodak
File:P[Ii][Cc][T_\s-]?\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # Minolta
File:PANA\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # Panasonic
File:DUW\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # some mobile phones
File:CIMG\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # Casio
File:JD\d+\.JPG  <reupload | errmsg=titleblacklist-generic-filename>  # Jenoptik
# Other common patterns
File:\d{9}[A-Z]{6}_[A-Z]{2}\P{L}*\.\w+  <reupload | errmsg=titleblacklist-generic-filename>  # some image hosting site?
File:\d+_[\da-f]+(_[a-z])?\P{L}*\.\w+  <reupload | errmsg=titleblacklist-generic-filename>  # Flickr; see http://www.flickr.com/services/api/misc.urls.html
# File:(\d{9,10}[A-Z])+[A-Z]?\.\w+  <reupload | errmsg=titleblacklist-generic-filename>  # yet another image hosting site? (redundant to "no more than two contiguous letters")
File:([\dA-F]{8}-)?[\dA-F]{4}-[\dA-F]{4}-[\dA-F]{4}-?[\dA-F]{12}.*  <reupload | errmsg=titleblacklist-generic-filename>  # UUID (with some variations included)
File:([SML]|\d+)_[\dA-F]{10,}(-\d+-|_?(\w\w?|full))?\.[^.]+  <reupload | errmsg=titleblacklist-generic-filename>  # L_9173c67eae58edc35ba7f2df08a7d5c6.jpg, 1_bf38bcd9c5512a5ab99ca2219a4b1e2f_full.gif, etc.
File:\P{L}*No\P{L}*name\P{L}*\.[^.]+  <reupload | errmsg=titleblacklist-generic-filename>  # Noname2.jpg
File:AT[AEIMQUYcgkosw048]AAA[A-D][-_A-Za-z0-9]+\.jpg  <reupload | errmsg=titleblacklist-generic-filename>
#Pict or Picture followed by numbers
File:Pict(ure)?[\s_-]?\d*\..* <reupload | errmsg=titleblacklist-generic-filename>
# Only numbers in the filename
File:\d+\.\w+ <reupload|errmsg=titleblacklist-generic-filename>
# Emoji
.*[©®‼⁉™ℹ↔↕↖↗↘↙↩↪⌚⌛⌨⏏⏩⏪⏫⏬⏭⏮⏯⏰⏱⏲⏳⏸⏹⏺Ⓜ▪▫▶◀◻◼◽◾☀☁☂☃☄☎☑☔☕☘☝☠☢☣☦☪☮☯☸☹☺♀♂♈♉♊♋♌♍♎♏♐♑♒♓♠♣♥♦♨♻♿⚒⚓⚔⚕⚖⚗⚙⚛⚜⚠⚡⚪⚫⚰⚱⚽⚾⛄⛅⛈⛎⛏⛑⛓⛔⛩⛪⛰⛱⛲⛳⛴⛵⛷⛸⛹⛺⛽✂✅✈✉✊✋✌✍✏✒✔✖✝✡✨✳✴❄❇❌❎❓❔❕❗❣❤➕➖➗➡➰➿⤴⤵⬅⬆⬇⬛⬜⭐⭕〰〽㊗㊙🀄🃏🅰🅱🅾🅿🆎🆑🆒🆓🆔🆕🆖🆗🆘🆙🆚🇦🇧🇨🇩🇪🇫🇬🇭🇮🇯🇰🇱🇲🇳🇴🇵🇶🇷🇸🇹🇺🇻🇼🇽🇾🇿🈁🈂🈚🈯🈲🈳🈴🈵🈶🈷🈸🈹🈺🉐🉑🌀🌁🌂🌃🌄🌅🌆🌇🌈🌉🌊🌋🌌🌍🌎🌏🌐🌑🌒🌓🌔🌕🌖🌗🌘🌙🌚🌛🌜🌝🌞🌟🌠🌡🌤🌥🌦🌧🌨🌩🌪🌫🌬🌭🌮🌯🌰🌱🌲🌳🌴🌵🌶🌷🌸🌹🌺🌻🌼🌽🌾🌿🍀🍁🍂🍃🍄🍅🍆🍇🍈🍉🍊🍋🍌🍍🍎🍏🍐🍑🍒🍓🍔🍕🍖🍗🍘🍙🍚🍛🍜🍝🍞🍟🍠🍡🍢🍣🍤🍥🍦🍧🍨🍩🍪🍫🍬🍭🍮🍯🍰🍱🍲🍳🍴🍵🍶🍷🍸🍹🍺🍻🍼🍽🍾🍿🎀🎁🎂🎃🎄🎅🎆🎇🎈🎉🎊🎋🎌🎍🎎🎏🎐🎑🎒🎓🎖🎗🎙🎚🎛🎞🎟🎠🎡🎢🎣🎤🎥🎦🎧🎨🎩🎪🎫🎬🎭🎮🎯🎰🎱🎲🎳🎴🎵🎶🎷🎸🎹🎺🎻🎼🎽🎾🎿🏀🏁🏂🏃🏄🏅🏆🏇🏈🏉🏊🏋🏌🏍🏎🏏🏐🏑🏒🏓🏔🏕🏖🏗🏘🏙🏚🏛🏜🏝🏞🏟🏠🏡🏢🏣🏤🏥🏦🏧🏨🏩🏪🏫🏬🏭🏮🏯🏰🏳🏴🏵🏷🏸🏹🏺🏻🏼🏽🏾🏿🐀🐁🐂🐃🐄🐅🐆🐇🐈🐉🐊🐋🐌🐍🐎🐏🐐🐑🐒🐓🐔🐕🐖🐗🐘🐙🐚🐛🐜🐝🐞🐟🐠🐡🐢🐣🐤🐥🐦🐧🐨🐩🐪🐫🐬🐭🐮🐯🐰🐱🐲🐳🐴🐵🐶🐷🐸🐹🐺🐻🐼🐽🐾🐿👀👁👂👃👄👅👆👇👈👉👊👋👌👍👎👏👐👑👒👓👔👕👖👗👘👙👚👛👜👝👞👟👠👡👢👣👤👥👦👧👨👩👪👫👬👭👮👯👰👱👲👳👴👵👶👷👸👹👺👻👼👽👾👿💀💁💂💃💄💅💆💇💈💉💊💋💌💍💎💏💐💑💒💓💔💕💖💗💘💙💚💛💜💝💞💟💠💡💢💣💤💥💦💧💨💩💪💫💬💭💮💯💰💱💲💳💴💵💶💷💸💹💺💻💼💽💾💿📀📁📂📃📄📅📆📇📈📉📊📋📌📍📎📏📐📑📒📓📔📕📖📗📘📙📚📛📜📝📞📟📠📡📢📣📤📥📦📧📨📩📪📫📬📭📮📯📰📱📲📳📴📵📶📷📸📹📺📻📼📽📿🔀🔁🔂🔃🔄🔅🔆🔇🔈🔉🔊🔋🔌🔍🔎🔏🔐🔑🔒🔓🔔🔕🔖🔗🔘🔙🔚🔛🔜🔝🔞🔟🔠🔡🔢🔣🔤🔥🔦🔧🔨🔩🔪🔫🔬🔭🔮🔯🔰🔱🔲🔳🔴🔵🔶🔷🔸🔹🔺🔻🔼🔽🕉🕊🕋🕌🕍🕎🕐🕑🕒🕓🕔🕕🕖🕗🕘🕙🕚🕛🕜🕝🕞🕟🕠🕡🕢🕣🕤🕥🕦🕧🕯🕰🕳🕴🕵🕶🕷🕸🕹🕺🖇🖊🖋🖌🖍🖐🖕🖖🖤🖥🖨🖱🖲🖼🗂🗃🗄🗑🗒🗓🗜🗝🗞🗡🗣🗨🗯🗳🗺🗻🗼🗽🗾🗿😀😁😂😃😄😅😆😇😈😉😊😋😌😍😎😏😐😑😒😓😔😕😖😗😘😙😚😛😜😝😞😟😠😡😢😣😤😥😦😧😨😩😪😫😬😭😮😯😰😱😲😳😴😵😶😷😸😹😺😻😼😽😾😿🙀🙁🙂🙃🙄🙅🙆🙇🙈🙉🙊🙋🙌🙍🙎🙏🚀🚁🚂🚃🚄🚅🚆🚇🚈🚉🚊🚋🚌🚍🚎🚏🚐🚑🚒🚓🚔🚕🚖🚗🚘🚙🚚🚛🚜🚝🚞🚟🚠🚡🚢🚣🚤🚥🚦🚧🚨🚩🚪🚫🚬🚭🚮🚯🚰🚱🚲🚳🚴🚵🚶🚷🚸🚹🚺🚻🚼🚽🚾🚿🛀🛁🛂🛃🛄🛅🛋🛌🛍🛎🛏🛐🛑🛒🛠🛡🛢🛣🛤🛥🛩🛫🛬🛰🛳🛴🛵🛶🤐🤑🤒🤓🤔🤕🤖🤗🤘🤙🤚🤛🤜🤝🤞🤠🤡🤢🤣🤤🤥🤦🤧🤰🤳🤴🤵🤶🤷🤸🤹🤺🤼🤽🤾🥀🥁🥂🥃🥄🥅🥇🥈🥉🥊🥋🥐🥑🥒🥓🥔🥕🥖🥗🥘🥙🥚🥛🥜🥝🥞🦀🦁🦂🦃🦄🦅🦆🦇🦈🦉🦊🦋🦌🦍🦎🦏🦐🦑🧀].* <autoconfirmed>
# Spam sites
.*(http|https|ftp|mailto|torrent|ed2k).*
.*(?:http|https|ftp|mailto|torrent|ed2k)\:\/\/[\w:@\-]+\.[\w\-]+.*
# IP user page (anti-spambot)
User:(?:(2([0-4]\d|5[0-5])|1\d\d|[1-9]?\d)\.){3}(?1)(\/.*)?
User:(?:([[:xdigit:] ]{1,4}):){7}(?1)(\/.*)?
#