MediaWiki:Titleblacklist

From Discovery Wiki
Revision as of 12:04, 9 April 2012 by Blodo (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search
# This is a title blacklist. Titles and users that match a regular expression here cannot be created.
# Use "#" for comments.
# This is case insensitive by default
# Blocking frequent bot registration names
User:[A-Z]{1}[a-z]+[A-Z]{1}[a-z]+[0-9]{3} <casesensitive|errmsg=titleblacklist-forbidden-new-account>
# Smash links in title
.*(?:http|https|ftp|mailto|torrent|ed2k)\:\/\/[\w:@\-]+\.[\w\-]+.*
# OBSCURE ASCII CHARACTER LOOKALIKES
.*[คℂ℃℄ɕยี่ƌʥℇ℈℉ℊๆℋℌℍℎℏℐ‼ℑℒℕ℗℘ℙℚℛℜℝ℞℟๔℣ℤℨ℩ℬℭ℮ℯℰℱℲℳℴℹ℺⅁⅂⅃⅄ⅅⅆⅇⅈⅉⅎ].* <casesensitive> # Select Unicode Letterlike Symbols
.*[\x{249C}-\x{24E9}].* <casesensitive> # Circled and parenthesized Latin letters
.*[\x{FF21}-\x{FF3A}\x{FF41}-\x{FF5A}].* <casesensitive | errmsg=titleblacklist-custom-fullwidth> # Fullwidth Latin letters
.*[?ʖ؟ʔ].* <casesensitive | moveonly> # Question mark lookalikes, used for page move vandalism
.*[\x{1D00}-\x{1DBF}].* <casesensitive> # Phonetic extensions, almost never used in valid titles
.*[\x{0250}-\x{02AF}].* <casesensitive | moveonly> # IPA extensions, somewhat more common, so blocking only moves for now
.*[∀-∑∟-∳⊂-⋕⋲-⋿].* <casesensitive | moveonly> # Select mathematical operators (excluding "−", "∞" and some other common ones)
.*[\x{27C0}-\x{27ef}\x{2980}-\x{29ff}\x{2a00}-\x{2aff}].* <casesensitive | moveonly> # Misc./supplemental mathematical symbols
.*[ⱧȢƪƪҤҺԊѓԍҥҩԌЃϓғҒЊӷⱤɌɍᵲᶉɼᵃᵍᴱᵣᔮᓂᑫᓈᒦ٨٣ץױוזשלﬨטּפּבּڠɆɇᶒⱸȺⱥᶏ١].* <casesensitive | moveonly> # Letter lookalikes; none of these are currently used in any mainspace title
# OTHER UNDESIRABLE CHARACTERS
.*[\x{00A0}\x{1680}\x{180E}\x{2000}-\x{200B}\x{2028}\x{2029}\x{202F}\x{205F}\x{3000}].* <casesensitive | errmsg=titleblacklist-custom-nbsp> # Non-breaking and other unusual spaces, with custom error message
.*[\x{202A}-\x{202E}].* <casesensitive> # BiDi overrides
.*[^\P{So}¦§©®°¶№™☀-⚲✁-➾`].* <casesensitive> # "Other punctuation", with some exceptions (may need more, this is a huge character class); note that single-character titles are permitted by the title whitelist
.*\p{Cc}.* <casesensitive> # Control characters
.*\x{FEFF}.* <casesensitive> # Byte order mark
.*[卍卐☭].* <casesensitive> # Swastikas, hammer-and-sickle
.*\x{00AD}.* <casesensitive> # Soft-hyphen
.*[^\0-\x{FFFF}].* <casesensitive> # Very few characters outside the Basic Multilingual Plane are useful in titles
.*[\x{2400}-\x{2426}].* <casesensitive> # Graphic pictures for control codes
# EXCESSIVE PUNCTUATION OR REPETITION
.*[!?‽¿]{3}(?<!!!!).*
.*[!?‽¿]{2}(?<!!!!).* <moveonly>
.*[!?‽¿]\s+[!?‽¿].*
.*‽‽.* <moveonly> 
.*¿¿.* <moveonly>
.*[\p{Z}]{2}.* # Disallows two adjacent "separator" characters (mostly funky spaces)
.*[^\p{L}\d ]{6}.* # Disallows six consecutive characters that are not letters (in any script), numbers, or spaces
.*([^0])\1{4}.* <moveonly> # Disallows four or more of the same character from page moves
.*(.)\1{10}.* <newaccountonly> # Disallows eleven or more of the same character repeated in usernames
.{40,} <newaccountonly>
.*\p{Lu}(\P{L}*\p{Lu}){9}.* <casesensitive | moveonly>  # Disallows moves with more than nine consecutive capital letters