1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162636465666768697071727374757677787980818283848586878889909192939495969798991001011021031041051061071081091101111121131141151161171181191201211221231241251261271281291301311321331341351361371381391401411421431441451461471481491501511521531541551561571581591601611621631641651661671681691701711721731741751761771781791801811821831841851861871881891901911921931941951961971981992002012022032042052062072082092102112122132142152162172182192202212222232242252262272282292302312322332342352362372382392402412422432442452462472482492502512522532542552562572582592602612622632642652662672682692702712722732742752762772782792802812822832842852862872882892902912922932942952962972982993003013023033043053063073083093103113123133143153163173183193203213223233243253263273283293303313323333343353363373383393403413423433443453463473483493503513523533543553563573583593603613623633643653663673683693703713723733743753763773783793803813823833843853863873883893903913923933943953963973983994004014024034044054064074084094104114124134144154164174184194204214224234244254264274284294304314324334344354364374384394404414424434444454464474484494504514524534544554564574584594604614624634644654664674684694704714724734744754764774784794804814824834844854864874884894904914924934944954964974984995005015025035045055065075085095105115125135145155165175185195205215225235245255265275285295305315325335345355365375385395405415425435445455465475485495505515525535545555565575585595605615625635645655665675685695705715725735745755765775785795805815825835845855865875885895905915925935945955965975985996006016026036046056066076086096106116126136146156166176186196206216226236246256266276286296306316326336346356366376386396406416426436446456466476486496506516526536546556566576586596606616626636646656666676686696706716726736746756766776786796806816826836846856866876886896906916926936946956966976986997007017027037047057067077087097107117127137147157167177187197207217227237247257267277287297307317327337347357367377387397407417427437447457467477487497507517527537547557567577587597607617627637647657667677687697707717727737747757767777787797807817827837847857867877887897907917927937947957967977987998008018028038048058068078088098108118128138148158168178188198208218228238248258268278288298308318328338348358368378388398408418428438448458468478488498508518528538548558568578588598608618628638648658668678688698708718728738748758768778788798808818828838848858868878888898908918928938948958968978988999009019029039049059069079089099109119129139149159169179189199209219229239249259269279289299309319329339349359369379389399409419429439449459469479489499509519529539549559569579589599609619629639649659669679689699709719729739749759769779789799809819829839849859869879889899909919929939949959969979989991000100110021003100410051006100710081009101010111012101310141015101610171018101910201021102210231024102510261027102810291030103110321033103410351036103710381039104010411042104310441045104610471048104910501051105210531054105510561057105810591060106110621063106410651066106710681069107010711072107310741075107610771078107910801081108210831084108510861087108810891090109110921093109410951096109710981099110011011102110311041105110611071108110911101111111211131114111511161117111811191120112111221123112411251126112711281129113011311132113311341135113611371138113911401141114211431144114511461147114811491150115111521153115411551156115711581159116011611162116311641165116611671168116911701171117211731174117511761177117811791180118111821183118411851186118711881189119011911192119311941195119611971198119912001201120212031204120512061207120812091210121112121213121412151216121712181219122012211222122312241225122612271228122912301231123212331234123512361237123812391240124112421243124412451246124712481249125012511252125312541255125612571258125912601261126212631264126512661267126812691270127112721273127412751276127712781279128012811282128312841285128612871288128912901291129212931294129512961297129812991300130113021303130413051306130713081309131013111312131313141315131613171318131913201321132213231324132513261327132813291330133113321333133413351336133713381339134013411342134313441345134613471348134913501351135213531354135513561357135813591360136113621363136413651366136713681369137013711372137313741375137613771378137913801381138213831384138513861387138813891390139113921393139413951396139713981399140014011402140314041405140614071408140914101411141214131414141514161417141814191420142114221423142414251426142714281429143014311432143314341435143614371438143914401441144214431444144514461447144814491450145114521453145414551456145714581459146014611462146314641465146614671468146914701471147214731474147514761477147814791480148114821483148414851486148714881489149014911492149314941495149614971498149915001501150215031504150515061507150815091510151115121513151415151516151715181519152015211522152315241525152615271528152915301531153215331534153515361537153815391540154115421543154415451546154715481549155015511552155315541555155615571558155915601561156215631564156515661567156815691570157115721573157415751576157715781579158015811582158315841585158615871588158915901591159215931594159515961597159815991600160116021603160416051606160716081609161016111612161316141615161616171618161916201621162216231624162516261627162816291630163116321633163416351636163716381639164016411642164316441645164616471648164916501651165216531654165516561657165816591660166116621663166416651666166716681669167016711672167316741675167616771678167916801681168216831684168516861687168816891690169116921693169416951696169716981699170017011702170317041705170617071708170917101711171217131714171517161717171817191720172117221723172417251726172717281729173017311732173317341735173617371738173917401741174217431744174517461747174817491750175117521753175417551756175717581759176017611762176317641765176617671768176917701771177217731774177517761777177817791780178117821783178417851786178717881789179017911792179317941795179617971798179918001801180218031804180518061807180818091810181118121813181418151816181718181819182018211822182318241825182618271828182918301831183218331834183518361837183818391840184118421843184418451846184718481849185018511852185318541855185618571858185918601861186218631864186518661867186818691870187118721873187418751876187718781879188018811882188318841885188618871888188918901891189218931894189518961897189818991900190119021903190419051906190719081909191019111912191319141915191619171918191919201921192219231924192519261927192819291930193119321933193419351936193719381939194019411942194319441945194619471948194919501951195219531954195519561957195819591960196119621963196419651966196719681969197019711972197319741975197619771978197919801981198219831984198519861987198819891990199119921993199419951996199719981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202420252026202720282029203020312032203320342035203620372038203920402041204220432044204520462047204820492050205120522053205420552056205720582059206020612062206320642065206620672068206920702071207220732074207520762077207820792080208120822083208420852086208720882089209020912092209320942095209620972098209921002101210221032104210521062107210821092110211121122113211421152116211721182119212021212122212321242125212621272128212921302131213221332134213521362137213821392140214121422143214421452146214721482149215021512152215321542155215621572158215921602161216221632164216521662167216821692170217121722173217421752176217721782179218021812182218321842185218621872188218921902191219221932194219521962197219821992200220122022203220422052206220722082209221022112212221322142215221622172218221922202221222222232224222522262227222822292230223122322233223422352236223722382239224022412242224322442245224622472248224922502251225222532254225522562257225822592260226122622263226422652266226722682269227022712272227322742275227622772278227922802281228222832284228522862287228822892290229122922293229422952296229722982299230023012302230323042305230623072308230923102311231223132314231523162317231823192320232123222323232423252326232723282329233023312332233323342335233623372338233923402341234223432344234523462347234823492350235123522353235423552356235723582359236023612362236323642365236623672368236923702371237223732374237523762377237823792380238123822383238423852386238723882389239023912392239323942395239623972398239924002401240224032404240524062407240824092410241124122413241424152416241724182419242024212422242324242425242624272428242924302431243224332434243524362437243824392440244124422443244424452446244724482449245024512452245324542455245624572458245924602461246224632464246524662467246824692470247124722473247424752476247724782479248024812482248324842485248624872488248924902491249224932494249524962497249824992500250125022503250425052506250725082509251025112512251325142515251625172518251925202521252225232524252525262527252825292530253125322533253425352536253725382539254025412542254325442545254625472548254925502551255225532554255525562557255825592560256125622563256425652566256725682569257025712572257325742575257625772578257925802581258225832584258525862587258825892590259125922593259425952596259725982599260026012602260326042605260626072608260926102611261226132614261526162617261826192620262126222623262426252626262726282629263026312632263326342635263626372638263926402641264226432644264526462647264826492650265126522653265426552656265726582659266026612662266326642665266626672668266926702671267226732674267526762677267826792680268126822683268426852686268726882689269026912692269326942695269626972698269927002701270227032704270527062707270827092710271127122713271427152716271727182719272027212722272327242725272627272728272927302731273227332734273527362737273827392740274127422743274427452746274727482749275027512752275327542755275627572758275927602761276227632764276527662767276827692770277127722773277427752776277727782779278027812782278327842785278627872788278927902791279227932794279527962797279827992800280128022803280428052806280728082809281028112812281328142815281628172818281928202821282228232824282528262827282828292830283128322833283428352836283728382839284028412842284328442845284628472848284928502851285228532854285528562857285828592860286128622863286428652866286728682869287028712872287328742875287628772878287928802881288228832884288528862887288828892890289128922893289428952896289728982899290029012902290329042905290629072908290929102911291229132914291529162917291829192920292129222923292429252926292729282929293029312932293329342935293629372938293929402941294229432944294529462947294829492950295129522953295429552956295729582959296029612962296329642965296629672968296929702971297229732974297529762977297829792980298129822983298429852986298729882989299029912992299329942995299629972998299930003001300230033004300530063007300830093010301130123013301430153016301730183019302030213022302330243025302630273028302930303031303230333034303530363037303830393040304130423043304430453046304730483049305030513052305330543055305630573058305930603061306230633064306530663067306830693070307130723073307430753076307730783079308030813082308330843085308630873088308930903091309230933094309530963097309830993100310131023103310431053106310731083109311031113112311331143115311631173118311931203121312231233124312531263127312831293130313131323133313431353136313731383139314031413142314331443145314631473148314931503151315231533154315531563157315831593160316131623163316431653166316731683169317031713172317331743175317631773178317931803181318231833184318531863187318831893190319131923193319431953196319731983199320032013202320332043205320632073208320932103211321232133214321532163217321832193220322132223223322432253226322732283229323032313232323332343235323632373238323932403241324232433244324532463247324832493250325132523253325432553256325732583259326032613262326332643265326632673268326932703271327232733274327532763277327832793280328132823283328432853286328732883289329032913292329332943295329632973298329933003301330233033304330533063307330833093310331133123313331433153316331733183319332033213322332333243325332633273328332933303331333233333334333533363337333833393340334133423343334433453346334733483349335033513352335333543355335633573358335933603361336233633364336533663367336833693370337133723373337433753376337733783379338033813382338333843385338633873388338933903391339233933394339533963397339833993400340134023403340434053406340734083409341034113412341334143415341634173418341934203421342234233424342534263427342834293430343134323433343434353436343734383439344034413442344334443445344634473448344934503451345234533454345534563457345834593460346134623463346434653466346734683469347034713472347334743475347634773478347934803481348234833484348534863487348834893490349134923493349434953496349734983499350035013502350335043505350635073508350935103511351235133514351535163517351835193520352135223523352435253526352735283529353035313532353335343535353635373538353935403541354235433544354535463547354835493550355135523553355435553556355735583559356035613562356335643565356635673568356935703571357235733574357535763577357835793580358135823583358435853586358735883589359035913592359335943595359635973598359936003601360236033604360536063607360836093610361136123613361436153616361736183619362036213622362336243625362636273628362936303631363236333634363536363637363836393640364136423643364436453646364736483649365036513652365336543655365636573658365936603661366236633664366536663667366836693670367136723673367436753676367736783679368036813682368336843685368636873688368936903691369236933694369536963697369836993700370137023703370437053706370737083709371037113712371337143715371637173718371937203721372237233724372537263727372837293730373137323733373437353736373737383739374037413742374337443745374637473748374937503751375237533754375537563757375837593760376137623763376437653766376737683769377037713772377337743775377637773778377937803781378237833784378537863787378837893790379137923793379437953796379737983799380038013802380338043805380638073808380938103811381238133814381538163817381838193820382138223823382438253826382738283829383038313832383338343835383638373838383938403841384238433844384538463847384838493850385138523853385438553856385738583859386038613862386338643865386638673868386938703871387238733874387538763877387838793880388138823883388438853886388738883889389038913892389338943895389638973898389939003901390239033904390539063907390839093910391139123913391439153916391739183919392039213922392339243925392639273928392939303931393239333934393539363937393839393940394139423943394439453946394739483949395039513952395339543955395639573958395939603961396239633964396539663967396839693970397139723973397439753976397739783979398039813982398339843985398639873988398939903991399239933994399539963997399839994000400140024003400440054006400740084009401040114012401340144015401640174018401940204021402240234024402540264027402840294030403140324033403440354036403740384039404040414042404340444045404640474048404940504051405240534054405540564057405840594060406140624063406440654066406740684069407040714072407340744075407640774078407940804081408240834084408540864087408840894090409140924093409440954096409740984099410041014102410341044105410641074108410941104111411241134114411541164117411841194120412141224123412441254126412741284129413041314132413341344135413641374138413941404141414241434144414541464147414841494150415141524153415441554156415741584159416041614162416341644165416641674168416941704171417241734174417541764177417841794180418141824183418441854186418741884189419041914192419341944195419641974198419942004201420242034204420542064207420842094210421142124213421442154216421742184219422042214222422342244225422642274228422942304231423242334234423542364237423842394240424142424243424442454246424742484249425042514252425342544255425642574258425942604261426242634264426542664267426842694270427142724273427442754276427742784279428042814282428342844285428642874288428942904291429242934294429542964297429842994300430143024303430443054306430743084309431043114312431343144315431643174318431943204321432243234324432543264327432843294330433143324333433443354336433743384339434043414342434343444345434643474348434943504351435243534354435543564357435843594360436143624363436443654366436743684369437043714372437343744375437643774378437943804381438243834384438543864387438843894390439143924393439443954396439743984399440044014402440344044405440644074408440944104411441244134414441544164417441844194420442144224423442444254426442744284429443044314432443344344435443644374438443944404441444244434444444544464447444844494450445144524453445444554456445744584459446044614462446344644465446644674468446944704471447244734474447544764477447844794480448144824483448444854486448744884489449044914492449344944495449644974498449945004501450245034504450545064507450845094510451145124513451445154516451745184519452045214522452345244525452645274528452945304531453245334534453545364537453845394540454145424543454445454546454745484549455045514552455345544555455645574558455945604561456245634564456545664567456845694570457145724573457445754576457745784579458045814582458345844585458645874588458945904591459245934594459545964597459845994600460146024603460446054606460746084609461046114612461346144615461646174618461946204621462246234624462546264627462846294630463146324633463446354636463746384639464046414642464346444645464646474648464946504651465246534654465546564657465846594660466146624663466446654666466746684669467046714672467346744675467646774678467946804681468246834684468546864687468846894690469146924693469446954696469746984699470047014702470347044705470647074708470947104711471247134714471547164717471847194720472147224723472447254726472747284729473047314732473347344735473647374738473947404741474247434744474547464747474847494750475147524753475447554756475747584759476047614762476347644765476647674768476947704771477247734774477547764777477847794780478147824783478447854786478747884789479047914792479347944795479647974798479948004801480248034804480548064807480848094810481148124813481448154816481748184819482048214822482348244825482648274828482948304831483248334834483548364837483848394840484148424843484448454846484748484849485048514852485348544855485648574858485948604861486248634864486548664867486848694870487148724873487448754876487748784879488048814882488348844885488648874888488948904891489248934894489548964897489848994900490149024903490449054906490749084909491049114912491349144915491649174918491949204921492249234924492549264927492849294930493149324933493449354936493749384939494049414942494349444945494649474948494949504951495249534954495549564957495849594960496149624963496449654966496749684969497049714972497349744975497649774978497949804981498249834984498549864987498849894990499149924993499449954996499749984999500050015002500350045005500650075008500950105011501250135014501550165017501850195020502150225023502450255026502750285029503050315032503350345035503650375038503950405041504250435044504550465047504850495050505150525053505450555056505750585059506050615062506350645065506650675068506950705071507250735074507550765077507850795080508150825083508450855086508750885089509050915092509350945095509650975098509951005101510251035104510551065107510851095110511151125113511451155116511751185119512051215122512351245125512651275128512951305131513251335134513551365137513851395140514151425143514451455146514751485149515051515152515351545155515651575158515951605161516251635164516551665167516851695170517151725173517451755176517751785179518051815182518351845185518651875188518951905191519251935194519551965197519851995200520152025203520452055206520752085209521052115212521352145215521652175218521952205221522252235224522552265227522852295230523152325233523452355236523752385239524052415242524352445245524652475248524952505251525252535254525552565257525852595260526152625263526452655266526752685269527052715272527352745275527652775278527952805281528252835284528552865287528852895290529152925293529452955296529752985299530053015302530353045305530653075308530953105311531253135314531553165317531853195320532153225323532453255326532753285329533053315332533353345335533653375338533953405341534253435344534553465347534853495350535153525353535453555356535753585359536053615362536353645365536653675368536953705371537253735374537553765377537853795380538153825383538453855386538753885389539053915392539353945395539653975398539954005401540254035404540554065407540854095410541154125413541454155416541754185419542054215422542354245425542654275428542954305431543254335434543554365437543854395440544154425443544454455446544754485449545054515452545354545455545654575458545954605461546254635464546554665467546854695470547154725473547454755476547754785479548054815482548354845485548654875488548954905491549254935494549554965497549854995500550155025503550455055506550755085509551055115512551355145515551655175518551955205521552255235524552555265527552855295530553155325533553455355536553755385539554055415542554355445545554655475548554955505551555255535554555555565557555855595560556155625563556455655566556755685569557055715572557355745575557655775578557955805581558255835584558555865587558855895590559155925593559455955596559755985599560056015602560356045605560656075608560956105611561256135614561556165617561856195620562156225623562456255626562756285629563056315632563356345635563656375638563956405641564256435644564556465647564856495650565156525653565456555656565756585659566056615662566356645665566656675668566956705671567256735674567556765677567856795680568156825683568456855686568756885689569056915692569356945695569656975698569957005701570257035704570557065707570857095710571157125713571457155716571757185719572057215722572357245725572657275728572957305731573257335734573557365737573857395740574157425743574457455746574757485749575057515752575357545755575657575758575957605761576257635764576557665767576857695770577157725773577457755776577757785779578057815782578357845785578657875788578957905791579257935794579557965797579857995800580158025803580458055806580758085809581058115812581358145815581658175818581958205821582258235824582558265827582858295830583158325833583458355836583758385839584058415842584358445845584658475848584958505851585258535854585558565857585858595860586158625863586458655866586758685869587058715872587358745875587658775878587958805881588258835884588558865887588858895890589158925893589458955896589758985899590059015902590359045905590659075908590959105911591259135914591559165917591859195920592159225923592459255926592759285929593059315932593359345935593659375938593959405941594259435944594559465947594859495950595159525953595459555956595759585959596059615962596359645965596659675968596959705971597259735974597559765977597859795980598159825983598459855986598759885989599059915992599359945995599659975998599960006001600260036004600560066007600860096010601160126013601460156016601760186019602060216022602360246025602660276028602960306031603260336034603560366037603860396040604160426043604460456046604760486049605060516052605360546055605660576058605960606061606260636064606560666067606860696070607160726073607460756076607760786079608060816082608360846085608660876088608960906091609260936094609560966097609860996100610161026103610461056106610761086109611061116112611361146115611661176118611961206121612261236124612561266127612861296130613161326133613461356136613761386139614061416142614361446145614661476148614961506151615261536154615561566157615861596160616161626163616461656166616761686169617061716172617361746175617661776178617961806181618261836184618561866187618861896190619161926193619461956196619761986199620062016202620362046205620662076208620962106211621262136214621562166217621862196220622162226223622462256226622762286229623062316232623362346235623662376238623962406241624262436244624562466247624862496250625162526253625462556256625762586259626062616262626362646265626662676268626962706271627262736274627562766277627862796280628162826283628462856286628762886289629062916292629362946295629662976298629963006301630263036304630563066307630863096310631163126313631463156316631763186319632063216322632363246325632663276328632963306331633263336334633563366337633863396340634163426343634463456346634763486349635063516352635363546355635663576358635963606361636263636364636563666367636863696370637163726373637463756376637763786379638063816382638363846385638663876388638963906391639263936394639563966397639863996400640164026403640464056406640764086409641064116412641364146415641664176418641964206421642264236424642564266427642864296430643164326433643464356436643764386439644064416442644364446445644664476448644964506451645264536454645564566457645864596460646164626463646464656466646764686469647064716472647364746475647664776478647964806481648264836484648564866487648864896490649164926493649464956496649764986499650065016502650365046505650665076508650965106511651265136514651565166517651865196520652165226523652465256526652765286529653065316532653365346535653665376538653965406541654265436544654565466547654865496550655165526553655465556556655765586559656065616562656365646565656665676568656965706571657265736574657565766577657865796580658165826583658465856586658765886589659065916592659365946595659665976598659966006601660266036604660566066607660866096610661166126613661466156616661766186619662066216622662366246625662666276628662966306631663266336634663566366637663866396640664166426643664466456646664766486649665066516652665366546655665666576658665966606661666266636664666566666667666866696670667166726673667466756676667766786679668066816682668366846685668666876688668966906691669266936694669566966697669866996700670167026703670467056706670767086709671067116712671367146715671667176718671967206721672267236724672567266727672867296730673167326733673467356736673767386739674067416742674367446745674667476748674967506751675267536754675567566757675867596760676167626763676467656766676767686769677067716772677367746775677667776778677967806781678267836784678567866787678867896790679167926793679467956796679767986799680068016802680368046805680668076808680968106811681268136814681568166817681868196820682168226823682468256826682768286829683068316832683368346835683668376838683968406841684268436844684568466847684868496850685168526853685468556856685768586859686068616862686368646865686668676868686968706871687268736874687568766877687868796880688168826883688468856886688768886889689068916892689368946895689668976898689969006901690269036904690569066907690869096910691169126913691469156916691769186919692069216922692369246925692669276928692969306931693269336934693569366937693869396940694169426943694469456946694769486949695069516952695369546955695669576958695969606961696269636964696569666967696869696970697169726973697469756976697769786979698069816982698369846985698669876988698969906991699269936994699569966997699869997000700170027003700470057006700770087009701070117012701370147015701670177018701970207021702270237024702570267027702870297030703170327033703470357036703770387039704070417042704370447045704670477048704970507051705270537054705570567057705870597060706170627063706470657066706770687069707070717072707370747075707670777078707970807081708270837084708570867087708870897090709170927093709470957096709770987099710071017102710371047105710671077108710971107111711271137114711571167117711871197120712171227123712471257126712771287129713071317132713371347135713671377138713971407141714271437144714571467147714871497150715171527153715471557156715771587159716071617162716371647165716671677168716971707171717271737174717571767177717871797180718171827183718471857186718771887189719071917192719371947195719671977198719972007201720272037204720572067207720872097210721172127213721472157216721772187219722072217222722372247225722672277228722972307231723272337234723572367237723872397240724172427243724472457246724772487249725072517252725372547255725672577258725972607261726272637264726572667267726872697270727172727273727472757276727772787279728072817282728372847285728672877288728972907291729272937294729572967297729872997300730173027303730473057306730773087309731073117312731373147315731673177318731973207321732273237324732573267327732873297330733173327333733473357336733773387339734073417342734373447345734673477348734973507351735273537354735573567357735873597360736173627363736473657366736773687369737073717372737373747375737673777378737973807381738273837384738573867387738873897390739173927393739473957396739773987399740074017402740374047405740674077408740974107411741274137414741574167417741874197420742174227423742474257426742774287429743074317432743374347435743674377438743974407441744274437444744574467447744874497450745174527453745474557456745774587459746074617462746374647465746674677468746974707471747274737474747574767477747874797480748174827483748474857486748774887489749074917492749374947495749674977498749975007501750275037504750575067507750875097510751175127513751475157516751775187519752075217522752375247525752675277528752975307531753275337534753575367537753875397540754175427543754475457546754775487549755075517552755375547555755675577558755975607561756275637564756575667567756875697570757175727573757475757576757775787579758075817582758375847585758675877588758975907591759275937594759575967597759875997600760176027603760476057606760776087609761076117612761376147615761676177618 |
- ---
- title: CommonMark Spec
- author: John MacFarlane
- version: 0.20
- date: 2015-06-08
- license: '[CC-BY-SA 4.0](http://creativecommons.org/licenses/by-sa/4.0/)'
- ...
- # Introduction
- ## What is Markdown?
- Markdown is a plain text format for writing structured documents,
- based on conventions used for indicating formatting in email and
- usenet posts. It was developed in 2004 by John Gruber, who wrote
- the first Markdown-to-HTML converter in perl, and it soon became
- widely used in websites. By 2014 there were dozens of
- implementations in many languages. Some of them extended basic
- Markdown syntax with conventions for footnotes, definition lists,
- tables, and other constructs, and some allowed output not just in
- HTML but in LaTeX and many other formats.
- ## Why is a spec needed?
- John Gruber's [canonical description of Markdown's
- syntax](http://daringfireball.net/projects/markdown/syntax)
- does not specify the syntax unambiguously. Here are some examples of
- questions it does not answer:
- 1. How much indentation is needed for a sublist? The spec says that
- continuation paragraphs need to be indented four spaces, but is
- not fully explicit about sublists. It is natural to think that
- they, too, must be indented four spaces, but `Markdown.pl` does
- not require that. This is hardly a "corner case," and divergences
- between implementations on this issue often lead to surprises for
- users in real documents. (See [this comment by John
- Gruber](http://article.gmane.org/gmane.text.markdown.general/1997).)
- 2. Is a blank line needed before a block quote or header?
- Most implementations do not require the blank line. However,
- this can lead to unexpected results in hard-wrapped text, and
- also to ambiguities in parsing (note that some implementations
- put the header inside the blockquote, while others do not).
- (John Gruber has also spoken [in favor of requiring the blank
- lines](http://article.gmane.org/gmane.text.markdown.general/2146).)
- 3. Is a blank line needed before an indented code block?
- (`Markdown.pl` requires it, but this is not mentioned in the
- documentation, and some implementations do not require it.)
- ``` markdown
- paragraph
- code?
- ```
- 4. What is the exact rule for determining when list items get
- wrapped in `<p>` tags? Can a list be partially "loose" and partially
- "tight"? What should we do with a list like this?
- ``` markdown
- 1. one
- 2. two
- 3. three
- ```
- Or this?
- ``` markdown
- 1. one
- - a
- - b
- 2. two
- ```
- (There are some relevant comments by John Gruber
- [here](http://article.gmane.org/gmane.text.markdown.general/2554).)
- 5. Can list markers be indented? Can ordered list markers be right-aligned?
- ``` markdown
- 8. item 1
- 9. item 2
- 10. item 2a
- ```
- 6. Is this one list with a horizontal rule in its second item,
- or two lists separated by a horizontal rule?
- ``` markdown
- * a
- * * * * *
- * b
- ```
- 7. When list markers change from numbers to bullets, do we have
- two lists or one? (The Markdown syntax description suggests two,
- but the perl scripts and many other implementations produce one.)
- ``` markdown
- 1. fee
- 2. fie
- - foe
- - fum
- ```
- 8. What are the precedence rules for the markers of inline structure?
- For example, is the following a valid link, or does the code span
- take precedence ?
- ``` markdown
- [a backtick (`)](/url) and [another backtick (`)](/url).
- ```
- 9. What are the precedence rules for markers of emphasis and strong
- emphasis? For example, how should the following be parsed?
- ``` markdown
- *foo *bar* baz*
- ```
- 10. What are the precedence rules between block-level and inline-level
- structure? For example, how should the following be parsed?
- ``` markdown
- - `a long code span can contain a hyphen like this
- - and it can screw things up`
- ```
- 11. Can list items include section headers? (`Markdown.pl` does not
- allow this, but does allow blockquotes to include headers.)
- ``` markdown
- - # Heading
- ```
- 12. Can list items be empty?
- ``` markdown
- * a
- *
- * b
- ```
- 13. Can link references be defined inside block quotes or list items?
- ``` markdown
- > Blockquote [foo].
- >
- > [foo]: /url
- ```
- 14. If there are multiple definitions for the same reference, which takes
- precedence?
- ``` markdown
- [foo]: /url1
- [foo]: /url2
- [foo][]
- ```
- In the absence of a spec, early implementers consulted `Markdown.pl`
- to resolve these ambiguities. But `Markdown.pl` was quite buggy, and
- gave manifestly bad results in many cases, so it was not a
- satisfactory replacement for a spec.
- Because there is no unambiguous spec, implementations have diverged
- considerably. As a result, users are often surprised to find that
- a document that renders one way on one system (say, a github wiki)
- renders differently on another (say, converting to docbook using
- pandoc). To make matters worse, because nothing in Markdown counts
- as a "syntax error," the divergence often isn't discovered right away.
- ## About this document
- This document attempts to specify Markdown syntax unambiguously.
- It contains many examples with side-by-side Markdown and
- HTML. These are intended to double as conformance tests. An
- accompanying script `spec_tests.py` can be used to run the tests
- against any Markdown program:
- python test/spec_tests.py --spec spec.txt --program PROGRAM
- Since this document describes how Markdown is to be parsed into
- an abstract syntax tree, it would have made sense to use an abstract
- representation of the syntax tree instead of HTML. But HTML is capable
- of representing the structural distinctions we need to make, and the
- choice of HTML for the tests makes it possible to run the tests against
- an implementation without writing an abstract syntax tree renderer.
- This document is generated from a text file, `spec.txt`, written
- in Markdown with a small extension for the side-by-side tests.
- The script `tools/makespec.py` can be used to convert `spec.txt` into
- HTML or CommonMark (which can then be converted into other formats).
- In the examples, the `→` character is used to represent tabs.
- # Preliminaries
- ## Characters and lines
- Any sequence of [character]s is a valid CommonMark
- document.
- A [character](@character) is a unicode code point.
- This spec does not specify an encoding; it thinks of lines as composed
- of characters rather than bytes. A conforming parser may be limited
- to a certain encoding.
- A [line](@line) is a sequence of zero or more [character]s
- followed by a [line ending] or by the end of file.
- A [line ending](@line-ending) is a newline (`U+000A`), carriage return
- (`U+000D`), or carriage return + newline.
- A line containing no characters, or a line containing only spaces
- (`U+0020`) or tabs (`U+0009`), is called a [blank line](@blank-line).
- The following definitions of character classes will be used in this spec:
- A [whitespace character](@whitespace-character) is a space
- (`U+0020`), tab (`U+0009`), newline (`U+000A`), line tabulation (`U+000B`),
- form feed (`U+000C`), or carriage return (`U+000D`).
- [Whitespace](@whitespace) is a sequence of one or more [whitespace
- character]s.
- A [unicode whitespace character](@unicode-whitespace-character) is
- any code point in the unicode `Zs` class, or a tab (`U+0009`),
- carriage return (`U+000D`), newline (`U+000A`), or form feed
- (`U+000C`).
- [Unicode whitespace](@unicode-whitespace) is a sequence of one
- or more [unicode whitespace character]s.
- A [space](@space) is `U+0020`.
- A [non-space character](@non-space-character) is any character
- that is not a [whitespace character].
- An [ASCII punctuation character](@ascii-punctuation-character)
- is `!`, `"`, `#`, `$`, `%`, `&`, `'`, `(`, `)`,
- `*`, `+`, `,`, `-`, `.`, `/`, `:`, `;`, `<`, `=`, `>`, `?`, `@`,
- `[`, `\`, `]`, `^`, `_`, `` ` ``, `{`, `|`, `}`, or `~`.
- A [punctuation character](@punctuation-character) is an [ASCII
- punctuation character] or anything in
- the unicode classes `Pc`, `Pd`, `Pe`, `Pf`, `Pi`, `Po`, or `Ps`.
- ## Preprocessing
- Tabs in lines are immediately expanded to [spaces][space], with a tab
- stop of 4 characters:
- .
- →foo→baz→→bim
- .
- <pre><code>foo baz bim
- </code></pre>
- .
- .
- a→a
- ὐ→a
- .
- <pre><code>a a
- ὐ a
- </code></pre>
- .
- ## Insecure characters
- For security reasons, the Unicode character `U+0000` must be replaced
- with the replacement character (`U+FFFD`).
- # Blocks and inlines
- We can think of a document as a sequence of
- [blocks](@block)---structural elements like paragraphs, block
- quotations, lists, headers, rules, and code blocks. Some blocks (like
- block quotes and list items) contain other blocks; others (like
- headers and paragraphs) contain [inline](@inline) content---text,
- links, emphasized text, images, code, and so on.
- ## Precedence
- Indicators of block structure always take precedence over indicators
- of inline structure. So, for example, the following is a list with
- two items, not a list with one item containing a code span:
- .
- - `one
- - two`
- .
- <ul>
- <li>`one</li>
- <li>two`</li>
- </ul>
- .
- This means that parsing can proceed in two steps: first, the block
- structure of the document can be discerned; second, text lines inside
- paragraphs, headers, and other block constructs can be parsed for inline
- structure. The second step requires information about link reference
- definitions that will be available only at the end of the first
- step. Note that the first step requires processing lines in sequence,
- but the second can be parallelized, since the inline parsing of
- one block element does not affect the inline parsing of any other.
- ## Container blocks and leaf blocks
- We can divide blocks into two types:
- [container block](@container-block)s,
- which can contain other blocks, and [leaf block](@leaf-block)s,
- which cannot.
- # Leaf blocks
- This section describes the different kinds of leaf block that make up a
- Markdown document.
- ## Horizontal rules
- A line consisting of 0-3 spaces of indentation, followed by a sequence
- of three or more matching `-`, `_`, or `*` characters, each followed
- optionally by any number of spaces, forms a
- [horizontal rule](@horizontal-rule).
- .
- ***
- ---
- ___
- .
- <hr />
- <hr />
- <hr />
- .
- Wrong characters:
- .
- +++
- .
- <p>+++</p>
- .
- .
- ===
- .
- <p>===</p>
- .
- Not enough characters:
- .
- --
- **
- __
- .
- <p>--
- **
- __</p>
- .
- One to three spaces indent are allowed:
- .
- ***
- ***
- ***
- .
- <hr />
- <hr />
- <hr />
- .
- Four spaces is too many:
- .
- ***
- .
- <pre><code>***
- </code></pre>
- .
- .
- Foo
- ***
- .
- <p>Foo
- ***</p>
- .
- More than three characters may be used:
- .
- _____________________________________
- .
- <hr />
- .
- Spaces are allowed between the characters:
- .
- - - -
- .
- <hr />
- .
- .
- ** * ** * ** * **
- .
- <hr />
- .
- .
- - - - -
- .
- <hr />
- .
- Spaces are allowed at the end:
- .
- - - - -
- .
- <hr />
- .
- However, no other characters may occur in the line:
- .
- _ _ _ _ a
- a------
- ---a---
- .
- <p>_ _ _ _ a</p>
- <p>a------</p>
- <p>---a---</p>
- .
- It is required that all of the [non-space character]s be the same.
- So, this is not a horizontal rule:
- .
- *-*
- .
- <p><em>-</em></p>
- .
- Horizontal rules do not need blank lines before or after:
- .
- - foo
- ***
- - bar
- .
- <ul>
- <li>foo</li>
- </ul>
- <hr />
- <ul>
- <li>bar</li>
- </ul>
- .
- Horizontal rules can interrupt a paragraph:
- .
- Foo
- ***
- bar
- .
- <p>Foo</p>
- <hr />
- <p>bar</p>
- .
- If a line of dashes that meets the above conditions for being a
- horizontal rule could also be interpreted as the underline of a [setext
- header], the interpretation as a
- [setext header] takes precedence. Thus, for example,
- this is a setext header, not a paragraph followed by a horizontal rule:
- .
- Foo
- ---
- bar
- .
- <h2>Foo</h2>
- <p>bar</p>
- .
- When both a horizontal rule and a list item are possible
- interpretations of a line, the horizontal rule takes precedence:
- .
- * Foo
- * * *
- * Bar
- .
- <ul>
- <li>Foo</li>
- </ul>
- <hr />
- <ul>
- <li>Bar</li>
- </ul>
- .
- If you want a horizontal rule in a list item, use a different bullet:
- .
- - Foo
- - * * *
- .
- <ul>
- <li>Foo</li>
- <li>
- <hr />
- </li>
- </ul>
- .
- ## ATX headers
- An [ATX header](@atx-header)
- consists of a string of characters, parsed as inline content, between an
- opening sequence of 1--6 unescaped `#` characters and an optional
- closing sequence of any number of `#` characters. The opening sequence
- of `#` characters cannot be followed directly by a
- [non-space character]. The optional closing sequence of `#`s must be
- preceded by a [space] and may be followed by spaces only. The opening
- `#` character may be indented 0-3 spaces. The raw contents of the
- header are stripped of leading and trailing spaces before being parsed
- as inline content. The header level is equal to the number of `#`
- characters in the opening sequence.
- Simple headers:
- .
- # foo
- ## foo
- ### foo
- #### foo
- ##### foo
- ###### foo
- .
- <h1>foo</h1>
- <h2>foo</h2>
- <h3>foo</h3>
- <h4>foo</h4>
- <h5>foo</h5>
- <h6>foo</h6>
- .
- More than six `#` characters is not a header:
- .
- ####### foo
- .
- <p>####### foo</p>
- .
- At least one space is required between the `#` characters and the
- header's contents, unless the header is empty. Note that many
- implementations currently do not require the space. However, the
- space was required by the
- [original ATX implementation](http://www.aaronsw.com/2002/atx/atx.py),
- and it helps prevent things like the following from being parsed as
- headers:
- .
- #5 bolt
- #foobar
- .
- <p>#5 bolt</p>
- <p>#foobar</p>
- .
- This is not a header, because the first `#` is escaped:
- .
- \## foo
- .
- <p>## foo</p>
- .
- Contents are parsed as inlines:
- .
- # foo *bar* \*baz\*
- .
- <h1>foo <em>bar</em> *baz*</h1>
- .
- Leading and trailing blanks are ignored in parsing inline content:
- .
- # foo
- .
- <h1>foo</h1>
- .
- One to three spaces indentation are allowed:
- .
- ### foo
- ## foo
- # foo
- .
- <h3>foo</h3>
- <h2>foo</h2>
- <h1>foo</h1>
- .
- Four spaces are too much:
- .
- # foo
- .
- <pre><code># foo
- </code></pre>
- .
- .
- foo
- # bar
- .
- <p>foo
- # bar</p>
- .
- A closing sequence of `#` characters is optional:
- .
- ## foo ##
- ### bar ###
- .
- <h2>foo</h2>
- <h3>bar</h3>
- .
- It need not be the same length as the opening sequence:
- .
- # foo ##################################
- ##### foo ##
- .
- <h1>foo</h1>
- <h5>foo</h5>
- .
- Spaces are allowed after the closing sequence:
- .
- ### foo ###
- .
- <h3>foo</h3>
- .
- A sequence of `#` characters with a
- [non-space character] following it
- is not a closing sequence, but counts as part of the contents of the
- header:
- .
- ### foo ### b
- .
- <h3>foo ### b</h3>
- .
- The closing sequence must be preceded by a space:
- .
- # foo#
- .
- <h1>foo#</h1>
- .
- Backslash-escaped `#` characters do not count as part
- of the closing sequence:
- .
- ### foo \###
- ## foo #\##
- # foo \#
- .
- <h3>foo ###</h3>
- <h2>foo ###</h2>
- <h1>foo #</h1>
- .
- ATX headers need not be separated from surrounding content by blank
- lines, and they can interrupt paragraphs:
- .
- ****
- ## foo
- ****
- .
- <hr />
- <h2>foo</h2>
- <hr />
- .
- .
- Foo bar
- # baz
- Bar foo
- .
- <p>Foo bar</p>
- <h1>baz</h1>
- <p>Bar foo</p>
- .
- ATX headers can be empty:
- .
- ##
- #
- ### ###
- .
- <h2></h2>
- <h1></h1>
- <h3></h3>
- .
- ## Setext headers
- A [setext header](@setext-header)
- consists of a line of text, containing at least one [non-space character],
- with no more than 3 spaces indentation, followed by a [setext header
- underline]. The line of text must be
- one that, were it not followed by the setext header underline,
- would be interpreted as part of a paragraph: it cannot be
- interpretable as a [code fence], [ATX header][ATX headers],
- [block quote][block quotes], [horizontal rule][horizontal rules],
- [list item][list items], or [HTML block][HTML blocks].
- A [setext header underline](@setext-header-underline) is a sequence of
- `=` characters or a sequence of `-` characters, with no more than 3
- spaces indentation and any number of trailing spaces. If a line
- containing a single `-` can be interpreted as an
- empty [list items], it should be interpreted this way
- and not as a [setext header underline].
- The header is a level 1 header if `=` characters are used in the
- [setext header underline], and a level 2
- header if `-` characters are used. The contents of the header are the
- result of parsing the first line as Markdown inline content.
- In general, a setext header need not be preceded or followed by a
- blank line. However, it cannot interrupt a paragraph, so when a
- setext header comes after a paragraph, a blank line is needed between
- them.
- Simple examples:
- .
- Foo *bar*
- =========
- Foo *bar*
- ---------
- .
- <h1>Foo <em>bar</em></h1>
- <h2>Foo <em>bar</em></h2>
- .
- The underlining can be any length:
- .
- Foo
- -------------------------
- Foo
- =
- .
- <h2>Foo</h2>
- <h1>Foo</h1>
- .
- The header content can be indented up to three spaces, and need
- not line up with the underlining:
- .
- Foo
- ---
- Foo
- -----
- Foo
- ===
- .
- <h2>Foo</h2>
- <h2>Foo</h2>
- <h1>Foo</h1>
- .
- Four spaces indent is too much:
- .
- Foo
- ---
- Foo
- ---
- .
- <pre><code>Foo
- ---
- Foo
- </code></pre>
- <hr />
- .
- The setext header underline can be indented up to three spaces, and
- may have trailing spaces:
- .
- Foo
- ----
- .
- <h2>Foo</h2>
- .
- Four spaces is too much:
- .
- Foo
- ---
- .
- <p>Foo
- ---</p>
- .
- The setext header underline cannot contain internal spaces:
- .
- Foo
- = =
- Foo
- --- -
- .
- <p>Foo
- = =</p>
- <p>Foo</p>
- <hr />
- .
- Trailing spaces in the content line do not cause a line break:
- .
- Foo
- -----
- .
- <h2>Foo</h2>
- .
- Nor does a backslash at the end:
- .
- Foo\
- ----
- .
- <h2>Foo\</h2>
- .
- Since indicators of block structure take precedence over
- indicators of inline structure, the following are setext headers:
- .
- `Foo
- ----
- `
- <a title="a lot
- ---
- of dashes"/>
- .
- <h2>`Foo</h2>
- <p>`</p>
- <h2><a title="a lot</h2>
- <p>of dashes"/></p>
- .
- The setext header underline cannot be a [lazy continuation
- line] in a list item or block quote:
- .
- > Foo
- ---
- .
- <blockquote>
- <p>Foo</p>
- </blockquote>
- <hr />
- .
- .
- - Foo
- ---
- .
- <ul>
- <li>Foo</li>
- </ul>
- <hr />
- .
- A setext header cannot interrupt a paragraph:
- .
- Foo
- Bar
- ---
- Foo
- Bar
- ===
- .
- <p>Foo
- Bar</p>
- <hr />
- <p>Foo
- Bar
- ===</p>
- .
- But in general a blank line is not required before or after:
- .
- ---
- Foo
- ---
- Bar
- ---
- Baz
- .
- <hr />
- <h2>Foo</h2>
- <h2>Bar</h2>
- <p>Baz</p>
- .
- Setext headers cannot be empty:
- .
- ====
- .
- <p>====</p>
- .
- Setext header text lines must not be interpretable as block
- constructs other than paragraphs. So, the line of dashes
- in these examples gets interpreted as a horizontal rule:
- .
- ---
- ---
- .
- <hr />
- <hr />
- .
- .
- - foo
- -----
- .
- <ul>
- <li>foo</li>
- </ul>
- <hr />
- .
- .
- foo
- ---
- .
- <pre><code>foo
- </code></pre>
- <hr />
- .
- .
- > foo
- -----
- .
- <blockquote>
- <p>foo</p>
- </blockquote>
- <hr />
- .
- If you want a header with `> foo` as its literal text, you can
- use backslash escapes:
- .
- \> foo
- ------
- .
- <h2>> foo</h2>
- .
- ## Indented code blocks
- An [indented code block](@indented-code-block) is composed of one or more
- [indented chunk]s separated by blank lines.
- An [indented chunk](@indented-chunk) is a sequence of non-blank lines,
- each indented four or more spaces. The contents of the code block are
- the literal contents of the lines, including trailing
- [line ending]s, minus four spaces of indentation.
- An indented code block has no [info string].
- An indented code block cannot interrupt a paragraph, so there must be
- a blank line between a paragraph and a following indented code block.
- (A blank line is not needed, however, between a code block and a following
- paragraph.)
- .
- a simple
- indented code block
- .
- <pre><code>a simple
- indented code block
- </code></pre>
- .
- If there is any ambiguity between an interpretation of indentation
- as a code block and as indicating that material belongs to a [list
- item][list items], the list item interpretation takes precedence:
- .
- - foo
- bar
- .
- <ul>
- <li>
- <p>foo</p>
- <p>bar</p>
- </li>
- </ul>
- .
- .
- 1. foo
- - bar
- .
- <ol>
- <li>
- <p>foo</p>
- <ul>
- <li>bar</li>
- </ul>
- </li>
- </ol>
- .
- The contents of a code block are literal text, and do not get parsed
- as Markdown:
- .
- <a/>
- *hi*
- - one
- .
- <pre><code><a/>
- *hi*
- - one
- </code></pre>
- .
- Here we have three chunks separated by blank lines:
- .
- chunk1
- chunk2
-
-
-
- chunk3
- .
- <pre><code>chunk1
- chunk2
- chunk3
- </code></pre>
- .
- Any initial spaces beyond four will be included in the content, even
- in interior blank lines:
- .
- chunk1
-
- chunk2
- .
- <pre><code>chunk1
-
- chunk2
- </code></pre>
- .
- An indented code block cannot interrupt a paragraph. (This
- allows hanging indents and the like.)
- .
- Foo
- bar
- .
- <p>Foo
- bar</p>
- .
- However, any non-blank line with fewer than four leading spaces ends
- the code block immediately. So a paragraph may occur immediately
- after indented code:
- .
- foo
- bar
- .
- <pre><code>foo
- </code></pre>
- <p>bar</p>
- .
- And indented code can occur immediately before and after other kinds of
- blocks:
- .
- # Header
- foo
- Header
- ------
- foo
- ----
- .
- <h1>Header</h1>
- <pre><code>foo
- </code></pre>
- <h2>Header</h2>
- <pre><code>foo
- </code></pre>
- <hr />
- .
- The first line can be indented more than four spaces:
- .
- foo
- bar
- .
- <pre><code> foo
- bar
- </code></pre>
- .
- Blank lines preceding or following an indented code block
- are not included in it:
- .
-
- foo
-
- .
- <pre><code>foo
- </code></pre>
- .
- Trailing spaces are included in the code block's content:
- .
- foo
- .
- <pre><code>foo
- </code></pre>
- .
- ## Fenced code blocks
- A [code fence](@code-fence) is a sequence
- of at least three consecutive backtick characters (`` ` ``) or
- tildes (`~`). (Tildes and backticks cannot be mixed.)
- A [fenced code block](@fenced-code-block)
- begins with a code fence, indented no more than three spaces.
- The line with the opening code fence may optionally contain some text
- following the code fence; this is trimmed of leading and trailing
- spaces and called the [info string](@info-string).
- The [info string] may not contain any backtick
- characters. (The reason for this restriction is that otherwise
- some inline code would be incorrectly interpreted as the
- beginning of a fenced code block.)
- The content of the code block consists of all subsequent lines, until
- a closing [code fence] of the same type as the code block
- began with (backticks or tildes), and with at least as many backticks
- or tildes as the opening code fence. If the leading code fence is
- indented N spaces, then up to N spaces of indentation are removed from
- each line of the content (if present). (If a content line is not
- indented, it is preserved unchanged. If it is indented less than N
- spaces, all of the indentation is removed.)
- The closing code fence may be indented up to three spaces, and may be
- followed only by spaces, which are ignored. If the end of the
- containing block (or document) is reached and no closing code fence
- has been found, the code block contains all of the lines after the
- opening code fence until the end of the containing block (or
- document). (An alternative spec would require backtracking in the
- event that a closing code fence is not found. But this makes parsing
- much less efficient, and there seems to be no real down side to the
- behavior described here.)
- A fenced code block may interrupt a paragraph, and does not require
- a blank line either before or after.
- The content of a code fence is treated as literal text, not parsed
- as inlines. The first word of the [info string] is typically used to
- specify the language of the code sample, and rendered in the `class`
- attribute of the `code` tag. However, this spec does not mandate any
- particular treatment of the [info string].
- Here is a simple example with backticks:
- .
- ```
- <
- >
- ```
- .
- <pre><code><
- >
- </code></pre>
- .
- With tildes:
- .
- ~~~
- <
- >
- ~~~
- .
- <pre><code><
- >
- </code></pre>
- .
- The closing code fence must use the same character as the opening
- fence:
- .
- ```
- aaa
- ~~~
- ```
- .
- <pre><code>aaa
- ~~~
- </code></pre>
- .
- .
- ~~~
- aaa
- ```
- ~~~
- .
- <pre><code>aaa
- ```
- </code></pre>
- .
- The closing code fence must be at least as long as the opening fence:
- .
- ````
- aaa
- ```
- ``````
- .
- <pre><code>aaa
- ```
- </code></pre>
- .
- .
- ~~~~
- aaa
- ~~~
- ~~~~
- .
- <pre><code>aaa
- ~~~
- </code></pre>
- .
- Unclosed code blocks are closed by the end of the document:
- .
- ```
- .
- <pre><code></code></pre>
- .
- .
- `````
- ```
- aaa
- .
- <pre><code>
- ```
- aaa
- </code></pre>
- .
- A code block can have all empty lines as its content:
- .
- ```
-
- ```
- .
- <pre><code>
-
- </code></pre>
- .
- A code block can be empty:
- .
- ```
- ```
- .
- <pre><code></code></pre>
- .
- Fences can be indented. If the opening fence is indented,
- content lines will have equivalent opening indentation removed,
- if present:
- .
- ```
- aaa
- aaa
- ```
- .
- <pre><code>aaa
- aaa
- </code></pre>
- .
- .
- ```
- aaa
- aaa
- aaa
- ```
- .
- <pre><code>aaa
- aaa
- aaa
- </code></pre>
- .
- .
- ```
- aaa
- aaa
- aaa
- ```
- .
- <pre><code>aaa
- aaa
- aaa
- </code></pre>
- .
- Four spaces indentation produces an indented code block:
- .
- ```
- aaa
- ```
- .
- <pre><code>```
- aaa
- ```
- </code></pre>
- .
- Closing fences may be indented by 0-3 spaces, and their indentation
- need not match that of the opening fence:
- .
- ```
- aaa
- ```
- .
- <pre><code>aaa
- </code></pre>
- .
- .
- ```
- aaa
- ```
- .
- <pre><code>aaa
- </code></pre>
- .
- This is not a closing fence, because it is indented 4 spaces:
- .
- ```
- aaa
- ```
- .
- <pre><code>aaa
- ```
- </code></pre>
- .
- Code fences (opening and closing) cannot contain internal spaces:
- .
- ``` ```
- aaa
- .
- <p><code></code>
- aaa</p>
- .
- .
- ~~~~~~
- aaa
- ~~~ ~~
- .
- <pre><code>aaa
- ~~~ ~~
- </code></pre>
- .
- Fenced code blocks can interrupt paragraphs, and can be followed
- directly by paragraphs, without a blank line between:
- .
- foo
- ```
- bar
- ```
- baz
- .
- <p>foo</p>
- <pre><code>bar
- </code></pre>
- <p>baz</p>
- .
- Other blocks can also occur before and after fenced code blocks
- without an intervening blank line:
- .
- foo
- ---
- ~~~
- bar
- ~~~
- # baz
- .
- <h2>foo</h2>
- <pre><code>bar
- </code></pre>
- <h1>baz</h1>
- .
- An [info string] can be provided after the opening code fence.
- Opening and closing spaces will be stripped, and the first word, prefixed
- with `language-`, is used as the value for the `class` attribute of the
- `code` element within the enclosing `pre` element.
- .
- ```ruby
- def foo(x)
- return 3
- end
- ```
- .
- <pre><code class="language-ruby">def foo(x)
- return 3
- end
- </code></pre>
- .
- .
- ~~~~ ruby startline=3 $%@#$
- def foo(x)
- return 3
- end
- ~~~~~~~
- .
- <pre><code class="language-ruby">def foo(x)
- return 3
- end
- </code></pre>
- .
- .
- ````;
- ````
- .
- <pre><code class="language-;"></code></pre>
- .
- [Info string]s for backtick code blocks cannot contain backticks:
- .
- ``` aa ```
- foo
- .
- <p><code>aa</code>
- foo</p>
- .
- Closing code fences cannot have [info string]s:
- .
- ```
- ``` aaa
- ```
- .
- <pre><code>``` aaa
- </code></pre>
- .
- ## HTML blocks
- An [HTML block tag](@html-block-tag) is
- an [open tag] or [closing tag] whose tag
- name is one of the following (case-insensitive):
- `article`, `header`, `aside`, `hgroup`, `blockquote`, `hr`, `iframe`,
- `body`, `li`, `map`, `button`, `object`, `canvas`, `ol`, `caption`,
- `output`, `col`, `p`, `colgroup`, `pre`, `dd`, `progress`, `div`,
- `section`, `dl`, `table`, `td`, `dt`, `tbody`, `embed`, `textarea`,
- `fieldset`, `tfoot`, `figcaption`, `th`, `figure`, `thead`, `footer`,
- `tr`, `form`, `ul`, `h1`, `h2`, `h3`, `h4`, `h5`, `h6`, `video`,
- `script`, `style`.
- An [HTML block](@html-block) begins with an
- [HTML block tag], [HTML comment], [processing instruction],
- [declaration], or [CDATA section].
- It ends when a [blank line] or the end of the
- input is encountered. The initial line may be indented up to three
- spaces, and subsequent lines may have any indentation. The contents
- of the HTML block are interpreted as raw HTML, and will not be escaped
- in HTML output.
- Some simple examples:
- .
- <table>
- <tr>
- <td>
- hi
- </td>
- </tr>
- </table>
- okay.
- .
- <table>
- <tr>
- <td>
- hi
- </td>
- </tr>
- </table>
- <p>okay.</p>
- .
- .
- <div>
- *hello*
- <foo><a>
- .
- <div>
- *hello*
- <foo><a>
- .
- Here we have two HTML blocks with a Markdown paragraph between them:
- .
- <DIV CLASS="foo">
- *Markdown*
- </DIV>
- .
- <DIV CLASS="foo">
- <p><em>Markdown</em></p>
- </DIV>
- .
- In the following example, what looks like a Markdown code block
- is actually part of the HTML block, which continues until a blank
- line or the end of the document is reached:
- .
- <div></div>
- ``` c
- int x = 33;
- ```
- .
- <div></div>
- ``` c
- int x = 33;
- ```
- .
- A comment:
- .
- <!-- Foo
- bar
- baz -->
- .
- <!-- Foo
- bar
- baz -->
- .
- A processing instruction:
- .
- <?php
- echo '>';
- ?>
- .
- <?php
- echo '>';
- ?>
- .
- CDATA:
- .
- <![CDATA[
- function matchwo(a,b)
- {
- if (a < b && a < 0) then
- {
- return 1;
- }
- else
- {
- return 0;
- }
- }
- ]]>
- .
- <![CDATA[
- function matchwo(a,b)
- {
- if (a < b && a < 0) then
- {
- return 1;
- }
- else
- {
- return 0;
- }
- }
- ]]>
- .
- The opening tag can be indented 1-3 spaces, but not 4:
- .
- <!-- foo -->
- <!-- foo -->
- .
- <!-- foo -->
- <pre><code><!-- foo -->
- </code></pre>
- .
- An HTML block can interrupt a paragraph, and need not be preceded
- by a blank line.
- .
- Foo
- <div>
- bar
- </div>
- .
- <p>Foo</p>
- <div>
- bar
- </div>
- .
- However, a following blank line is always needed, except at the end of
- a document:
- .
- <div>
- bar
- </div>
- *foo*
- .
- <div>
- bar
- </div>
- *foo*
- .
- An incomplete HTML block tag may also start an HTML block:
- .
- <div class
- foo
- .
- <div class
- foo
- .
- This rule differs from John Gruber's original Markdown syntax
- specification, which says:
- > The only restrictions are that block-level HTML elements —
- > e.g. `<div>`, `<table>`, `<pre>`, `<p>`, etc. — must be separated from
- > surrounding content by blank lines, and the start and end tags of the
- > block should not be indented with tabs or spaces.
- In some ways Gruber's rule is more restrictive than the one given
- here:
- - It requires that an HTML block be preceded by a blank line.
- - It does not allow the start tag to be indented.
- - It requires a matching end tag, which it also does not allow to
- be indented.
- Indeed, most Markdown implementations, including some of Gruber's
- own perl implementations, do not impose these restrictions.
- There is one respect, however, in which Gruber's rule is more liberal
- than the one given here, since it allows blank lines to occur inside
- an HTML block. There are two reasons for disallowing them here.
- First, it removes the need to parse balanced tags, which is
- expensive and can require backtracking from the end of the document
- if no matching end tag is found. Second, it provides a very simple
- and flexible way of including Markdown content inside HTML tags:
- simply separate the Markdown from the HTML using blank lines:
- .
- <div>
- *Emphasized* text.
- </div>
- .
- <div>
- <p><em>Emphasized</em> text.</p>
- </div>
- .
- Compare:
- .
- <div>
- *Emphasized* text.
- </div>
- .
- <div>
- *Emphasized* text.
- </div>
- .
- Some Markdown implementations have adopted a convention of
- interpreting content inside tags as text if the open tag has
- the attribute `markdown=1`. The rule given above seems a simpler and
- more elegant way of achieving the same expressive power, which is also
- much simpler to parse.
- The main potential drawback is that one can no longer paste HTML
- blocks into Markdown documents with 100% reliability. However,
- *in most cases* this will work fine, because the blank lines in
- HTML are usually followed by HTML block tags. For example:
- .
- <table>
- <tr>
- <td>
- Hi
- </td>
- </tr>
- </table>
- .
- <table>
- <tr>
- <td>
- Hi
- </td>
- </tr>
- </table>
- .
- Moreover, blank lines are usually not necessary and can be
- deleted. The exception is inside `<pre>` tags; here, one can
- replace the blank lines with ` ` entities.
- So there is no important loss of expressive power with the new rule.
- ## Link reference definitions
- A [link reference definition](@link-reference-definition)
- consists of a [link label], indented up to three spaces, followed
- by a colon (`:`), optional [whitespace] (including up to one
- [line ending]), a [link destination],
- optional [whitespace] (including up to one
- [line ending]), and an optional [link
- title], which if it is present must be separated
- from the [link destination] by [whitespace].
- No further [non-space character]s may occur on the line.
- A [link reference definition]
- does not correspond to a structural element of a document. Instead, it
- defines a label which can be used in [reference link]s
- and reference-style [images] elsewhere in the document. [Link
- reference definitions] can come either before or after the links that use
- them.
- .
- [foo]: /url "title"
- [foo]
- .
- <p><a href="/url" title="title">foo</a></p>
- .
- .
- [foo]:
- /url
- 'the title'
- [foo]
- .
- <p><a href="/url" title="the title">foo</a></p>
- .
- .
- [Foo*bar\]]:my_(url) 'title (with parens)'
- [Foo*bar\]]
- .
- <p><a href="my_(url)" title="title (with parens)">Foo*bar]</a></p>
- .
- .
- [Foo bar]:
- <my url>
- 'title'
- [Foo bar]
- .
- <p><a href="my%20url" title="title">Foo bar</a></p>
- .
- The title may extend over multiple lines:
- .
- [foo]: /url '
- title
- line1
- line2
- '
- [foo]
- .
- <p><a href="/url" title="
- title
- line1
- line2
- ">foo</a></p>
- .
- However, it may not contain a [blank line]:
- .
- [foo]: /url 'title
- with blank line'
- [foo]
- .
- <p>[foo]: /url 'title</p>
- <p>with blank line'</p>
- <p>[foo]</p>
- .
- The title may be omitted:
- .
- [foo]:
- /url
- [foo]
- .
- <p><a href="/url">foo</a></p>
- .
- The link destination may not be omitted:
- .
- [foo]:
- [foo]
- .
- <p>[foo]:</p>
- <p>[foo]</p>
- .
- A link can come before its corresponding definition:
- .
- [foo]
- [foo]: url
- .
- <p><a href="url">foo</a></p>
- .
- If there are several matching definitions, the first one takes
- precedence:
- .
- [foo]
- [foo]: first
- [foo]: second
- .
- <p><a href="first">foo</a></p>
- .
- As noted in the section on [Links], matching of labels is
- case-insensitive (see [matches]).
- .
- [FOO]: /url
- [Foo]
- .
- <p><a href="/url">Foo</a></p>
- .
- .
- [ΑΓΩ]: /φου
- [αγω]
- .
- <p><a href="/%CF%86%CE%BF%CF%85">αγω</a></p>
- .
- Here is a link reference definition with no corresponding link.
- It contributes nothing to the document.
- .
- [foo]: /url
- .
- .
- Here is another one:
- .
- [
- foo
- ]: /url
- bar
- .
- <p>bar</p>
- .
- This is not a link reference definition, because there are
- [non-space character]s after the title:
- .
- [foo]: /url "title" ok
- .
- <p>[foo]: /url "title" ok</p>
- .
- This is not a link reference definition, because it is indented
- four spaces:
- .
- [foo]: /url "title"
- [foo]
- .
- <pre><code>[foo]: /url "title"
- </code></pre>
- <p>[foo]</p>
- .
- This is not a link reference definition, because it occurs inside
- a code block:
- .
- ```
- [foo]: /url
- ```
- [foo]
- .
- <pre><code>[foo]: /url
- </code></pre>
- <p>[foo]</p>
- .
- A [link reference definition] cannot interrupt a paragraph.
- .
- Foo
- [bar]: /baz
- [bar]
- .
- <p>Foo
- [bar]: /baz</p>
- <p>[bar]</p>
- .
- However, it can directly follow other block elements, such as headers
- and horizontal rules, and it need not be followed by a blank line.
- .
- # [Foo]
- [foo]: /url
- > bar
- .
- <h1><a href="/url">Foo</a></h1>
- <blockquote>
- <p>bar</p>
- </blockquote>
- .
- Several [link reference definition]s
- can occur one after another, without intervening blank lines.
- .
- [foo]: /foo-url "foo"
- [bar]: /bar-url
- "bar"
- [baz]: /baz-url
- [foo],
- [bar],
- [baz]
- .
- <p><a href="/foo-url" title="foo">foo</a>,
- <a href="/bar-url" title="bar">bar</a>,
- <a href="/baz-url">baz</a></p>
- .
- [Link reference definition]s can occur
- inside block containers, like lists and block quotations. They
- affect the entire document, not just the container in which they
- are defined:
- .
- [foo]
- > [foo]: /url
- .
- <p><a href="/url">foo</a></p>
- <blockquote>
- </blockquote>
- .
- ## Paragraphs
- A sequence of non-blank lines that cannot be interpreted as other
- kinds of blocks forms a [paragraph](@paragraph).
- The contents of the paragraph are the result of parsing the
- paragraph's raw content as inlines. The paragraph's raw content
- is formed by concatenating the lines and removing initial and final
- [whitespace].
- A simple example with two paragraphs:
- .
- aaa
- bbb
- .
- <p>aaa</p>
- <p>bbb</p>
- .
- Paragraphs can contain multiple lines, but no blank lines:
- .
- aaa
- bbb
- ccc
- ddd
- .
- <p>aaa
- bbb</p>
- <p>ccc
- ddd</p>
- .
- Multiple blank lines between paragraph have no effect:
- .
- aaa
- bbb
- .
- <p>aaa</p>
- <p>bbb</p>
- .
- Leading spaces are skipped:
- .
- aaa
- bbb
- .
- <p>aaa
- bbb</p>
- .
- Lines after the first may be indented any amount, since indented
- code blocks cannot interrupt paragraphs.
- .
- aaa
- bbb
- ccc
- .
- <p>aaa
- bbb
- ccc</p>
- .
- However, the first line may be indented at most three spaces,
- or an indented code block will be triggered:
- .
- aaa
- bbb
- .
- <p>aaa
- bbb</p>
- .
- .
- aaa
- bbb
- .
- <pre><code>aaa
- </code></pre>
- <p>bbb</p>
- .
- Final spaces are stripped before inline parsing, so a paragraph
- that ends with two or more spaces will not end with a [hard line
- break]:
- .
- aaa
- bbb
- .
- <p>aaa<br />
- bbb</p>
- .
- ## Blank lines
- [Blank line]s between block-level elements are ignored,
- except for the role they play in determining whether a [list]
- is [tight] or [loose].
- Blank lines at the beginning and end of the document are also ignored.
- .
-
- aaa
-
- # aaa
-
- .
- <p>aaa</p>
- <h1>aaa</h1>
- .
- # Container blocks
- A [container block] is a block that has other
- blocks as its contents. There are two basic kinds of container blocks:
- [block quotes] and [list items].
- [Lists] are meta-containers for [list items].
- We define the syntax for container blocks recursively. The general
- form of the definition is:
- > If X is a sequence of blocks, then the result of
- > transforming X in such-and-such a way is a container of type Y
- > with these blocks as its content.
- So, we explain what counts as a block quote or list item by explaining
- how these can be *generated* from their contents. This should suffice
- to define the syntax, although it does not give a recipe for *parsing*
- these constructions. (A recipe is provided below in the section entitled
- [A parsing strategy](#appendix-a-a-parsing-strategy).)
- ## Block quotes
- A [block quote marker](@block-quote-marker)
- consists of 0-3 spaces of initial indent, plus (a) the character `>` together
- with a following space, or (b) a single character `>` not followed by a space.
- The following rules define [block quotes]:
- 1. **Basic case.** If a string of lines *Ls* constitute a sequence
- of blocks *Bs*, then the result of prepending a [block quote
- marker] to the beginning of each line in *Ls*
- is a [block quote](#block-quotes) containing *Bs*.
- 2. **Laziness.** If a string of lines *Ls* constitute a [block
- quote](#block-quotes) with contents *Bs*, then the result of deleting
- the initial [block quote marker] from one or
- more lines in which the next [non-space character] after the [block
- quote marker] is [paragraph continuation
- text] is a block quote with *Bs* as its content.
- [Paragraph continuation text](@paragraph-continuation-text) is text
- that will be parsed as part of the content of a paragraph, but does
- not occur at the beginning of the paragraph.
- 3. **Consecutiveness.** A document cannot contain two [block
- quotes] in a row unless there is a [blank line] between them.
- Nothing else counts as a [block quote](#block-quotes).
- Here is a simple example:
- .
- > # Foo
- > bar
- > baz
- .
- <blockquote>
- <h1>Foo</h1>
- <p>bar
- baz</p>
- </blockquote>
- .
- The spaces after the `>` characters can be omitted:
- .
- ># Foo
- >bar
- > baz
- .
- <blockquote>
- <h1>Foo</h1>
- <p>bar
- baz</p>
- </blockquote>
- .
- The `>` characters can be indented 1-3 spaces:
- .
- > # Foo
- > bar
- > baz
- .
- <blockquote>
- <h1>Foo</h1>
- <p>bar
- baz</p>
- </blockquote>
- .
- Four spaces gives us a code block:
- .
- > # Foo
- > bar
- > baz
- .
- <pre><code>> # Foo
- > bar
- > baz
- </code></pre>
- .
- The Laziness clause allows us to omit the `>` before a
- paragraph continuation line:
- .
- > # Foo
- > bar
- baz
- .
- <blockquote>
- <h1>Foo</h1>
- <p>bar
- baz</p>
- </blockquote>
- .
- A block quote can contain some lazy and some non-lazy
- continuation lines:
- .
- > bar
- baz
- > foo
- .
- <blockquote>
- <p>bar
- baz
- foo</p>
- </blockquote>
- .
- Laziness only applies to lines that would have been continuations of
- paragraphs had they been prepended with `>`. For example, the
- `>` cannot be omitted in the second line of
- ``` markdown
- > foo
- > ---
- ```
- without changing the meaning:
- .
- > foo
- ---
- .
- <blockquote>
- <p>foo</p>
- </blockquote>
- <hr />
- .
- Similarly, if we omit the `>` in the second line of
- ``` markdown
- > - foo
- > - bar
- ```
- then the block quote ends after the first line:
- .
- > - foo
- - bar
- .
- <blockquote>
- <ul>
- <li>foo</li>
- </ul>
- </blockquote>
- <ul>
- <li>bar</li>
- </ul>
- .
- For the same reason, we can't omit the `>` in front of
- subsequent lines of an indented or fenced code block:
- .
- > foo
- bar
- .
- <blockquote>
- <pre><code>foo
- </code></pre>
- </blockquote>
- <pre><code>bar
- </code></pre>
- .
- .
- > ```
- foo
- ```
- .
- <blockquote>
- <pre><code></code></pre>
- </blockquote>
- <p>foo</p>
- <pre><code></code></pre>
- .
- A block quote can be empty:
- .
- >
- .
- <blockquote>
- </blockquote>
- .
- .
- >
- >
- >
- .
- <blockquote>
- </blockquote>
- .
- A block quote can have initial or final blank lines:
- .
- >
- > foo
- >
- .
- <blockquote>
- <p>foo</p>
- </blockquote>
- .
- A blank line always separates block quotes:
- .
- > foo
- > bar
- .
- <blockquote>
- <p>foo</p>
- </blockquote>
- <blockquote>
- <p>bar</p>
- </blockquote>
- .
- (Most current Markdown implementations, including John Gruber's
- original `Markdown.pl`, will parse this example as a single block quote
- with two paragraphs. But it seems better to allow the author to decide
- whether two block quotes or one are wanted.)
- Consecutiveness means that if we put these block quotes together,
- we get a single block quote:
- .
- > foo
- > bar
- .
- <blockquote>
- <p>foo
- bar</p>
- </blockquote>
- .
- To get a block quote with two paragraphs, use:
- .
- > foo
- >
- > bar
- .
- <blockquote>
- <p>foo</p>
- <p>bar</p>
- </blockquote>
- .
- Block quotes can interrupt paragraphs:
- .
- foo
- > bar
- .
- <p>foo</p>
- <blockquote>
- <p>bar</p>
- </blockquote>
- .
- In general, blank lines are not needed before or after block
- quotes:
- .
- > aaa
- ***
- > bbb
- .
- <blockquote>
- <p>aaa</p>
- </blockquote>
- <hr />
- <blockquote>
- <p>bbb</p>
- </blockquote>
- .
- However, because of laziness, a blank line is needed between
- a block quote and a following paragraph:
- .
- > bar
- baz
- .
- <blockquote>
- <p>bar
- baz</p>
- </blockquote>
- .
- .
- > bar
- baz
- .
- <blockquote>
- <p>bar</p>
- </blockquote>
- <p>baz</p>
- .
- .
- > bar
- >
- baz
- .
- <blockquote>
- <p>bar</p>
- </blockquote>
- <p>baz</p>
- .
- It is a consequence of the Laziness rule that any number
- of initial `>`s may be omitted on a continuation line of a
- nested block quote:
- .
- > > > foo
- bar
- .
- <blockquote>
- <blockquote>
- <blockquote>
- <p>foo
- bar</p>
- </blockquote>
- </blockquote>
- </blockquote>
- .
- .
- >>> foo
- > bar
- >>baz
- .
- <blockquote>
- <blockquote>
- <blockquote>
- <p>foo
- bar
- baz</p>
- </blockquote>
- </blockquote>
- </blockquote>
- .
- When including an indented code block in a block quote,
- remember that the [block quote marker] includes
- both the `>` and a following space. So *five spaces* are needed after
- the `>`:
- .
- > code
- > not code
- .
- <blockquote>
- <pre><code>code
- </code></pre>
- </blockquote>
- <blockquote>
- <p>not code</p>
- </blockquote>
- .
- ## List items
- A [list marker](@list-marker) is a
- [bullet list marker] or an [ordered list marker].
- A [bullet list marker](@bullet-list-marker)
- is a `-`, `+`, or `*` character.
- An [ordered list marker](@ordered-list-marker)
- is a sequence of one of more digits (`0-9`), followed by either a
- `.` character or a `)` character.
- The following rules define [list items]:
- 1. **Basic case.** If a sequence of lines *Ls* constitute a sequence of
- blocks *Bs* starting with a [non-space character] and not separated
- from each other by more than one blank line, and *M* is a list
- marker of width *W* followed by 0 < *N* < 5 spaces, then the result
- of prepending *M* and the following spaces to the first line of
- *Ls*, and indenting subsequent lines of *Ls* by *W + N* spaces, is a
- list item with *Bs* as its contents. The type of the list item
- (bullet or ordered) is determined by the type of its list marker.
- If the list item is ordered, then it is also assigned a start
- number, based on the ordered list marker.
- For example, let *Ls* be the lines
- .
- A paragraph
- with two lines.
- indented code
- > A block quote.
- .
- <p>A paragraph
- with two lines.</p>
- <pre><code>indented code
- </code></pre>
- <blockquote>
- <p>A block quote.</p>
- </blockquote>
- .
- And let *M* be the marker `1.`, and *N* = 2. Then rule #1 says
- that the following is an ordered list item with start number 1,
- and the same contents as *Ls*:
- .
- 1. A paragraph
- with two lines.
- indented code
- > A block quote.
- .
- <ol>
- <li>
- <p>A paragraph
- with two lines.</p>
- <pre><code>indented code
- </code></pre>
- <blockquote>
- <p>A block quote.</p>
- </blockquote>
- </li>
- </ol>
- .
- The most important thing to notice is that the position of
- the text after the list marker determines how much indentation
- is needed in subsequent blocks in the list item. If the list
- marker takes up two spaces, and there are three spaces between
- the list marker and the next [non-space character], then blocks
- must be indented five spaces in order to fall under the list
- item.
- Here are some examples showing how far content must be indented to be
- put under the list item:
- .
- - one
- two
- .
- <ul>
- <li>one</li>
- </ul>
- <p>two</p>
- .
- .
- - one
- two
- .
- <ul>
- <li>
- <p>one</p>
- <p>two</p>
- </li>
- </ul>
- .
- .
- - one
- two
- .
- <ul>
- <li>one</li>
- </ul>
- <pre><code> two
- </code></pre>
- .
- .
- - one
- two
- .
- <ul>
- <li>
- <p>one</p>
- <p>two</p>
- </li>
- </ul>
- .
- It is tempting to think of this in terms of columns: the continuation
- blocks must be indented at least to the column of the first
- [non-space character] after the list marker. However, that is not quite right.
- The spaces after the list marker determine how much relative indentation
- is needed. Which column this indentation reaches will depend on
- how the list item is embedded in other constructions, as shown by
- this example:
- .
- > > 1. one
- >>
- >> two
- .
- <blockquote>
- <blockquote>
- <ol>
- <li>
- <p>one</p>
- <p>two</p>
- </li>
- </ol>
- </blockquote>
- </blockquote>
- .
- Here `two` occurs in the same column as the list marker `1.`,
- but is actually contained in the list item, because there is
- sufficient indentation after the last containing blockquote marker.
- The converse is also possible. In the following example, the word `two`
- occurs far to the right of the initial text of the list item, `one`, but
- it is not considered part of the list item, because it is not indented
- far enough past the blockquote marker:
- .
- >>- one
- >>
- > > two
- .
- <blockquote>
- <blockquote>
- <ul>
- <li>one</li>
- </ul>
- <p>two</p>
- </blockquote>
- </blockquote>
- .
- Note that at least one space is needed between the list marker and
- any following content, so these are not list items:
- .
- -one
- 2.two
- .
- <p>-one</p>
- <p>2.two</p>
- .
- A list item may not contain blocks that are separated by more than
- one blank line. Thus, two blank lines will end a list, unless the
- two blanks are contained in a [fenced code block].
- .
- - foo
- bar
- - foo
- bar
- - ```
- foo
- bar
- ```
- - baz
- + ```
- foo
- bar
- ```
- .
- <ul>
- <li>
- <p>foo</p>
- <p>bar</p>
- </li>
- <li>
- <p>foo</p>
- </li>
- </ul>
- <p>bar</p>
- <ul>
- <li>
- <pre><code>foo
- bar
- </code></pre>
- </li>
- <li>
- <p>baz</p>
- <ul>
- <li>
- <pre><code>foo
- bar
- </code></pre>
- </li>
- </ul>
- </li>
- </ul>
- .
- A list item may contain any kind of block:
- .
- 1. foo
- ```
- bar
- ```
- baz
- > bam
- .
- <ol>
- <li>
- <p>foo</p>
- <pre><code>bar
- </code></pre>
- <p>baz</p>
- <blockquote>
- <p>bam</p>
- </blockquote>
- </li>
- </ol>
- .
- 2. **Item starting with indented code.** If a sequence of lines *Ls*
- constitute a sequence of blocks *Bs* starting with an indented code
- block and not separated from each other by more than one blank line,
- and *M* is a list marker of width *W* followed by
- one space, then the result of prepending *M* and the following
- space to the first line of *Ls*, and indenting subsequent lines of
- *Ls* by *W + 1* spaces, is a list item with *Bs* as its contents.
- If a line is empty, then it need not be indented. The type of the
- list item (bullet or ordered) is determined by the type of its list
- marker. If the list item is ordered, then it is also assigned a
- start number, based on the ordered list marker.
- An indented code block will have to be indented four spaces beyond
- the edge of the region where text will be included in the list item.
- In the following case that is 6 spaces:
- .
- - foo
- bar
- .
- <ul>
- <li>
- <p>foo</p>
- <pre><code>bar
- </code></pre>
- </li>
- </ul>
- .
- And in this case it is 11 spaces:
- .
- 10. foo
- bar
- .
- <ol start="10">
- <li>
- <p>foo</p>
- <pre><code>bar
- </code></pre>
- </li>
- </ol>
- .
- If the *first* block in the list item is an indented code block,
- then by rule #2, the contents must be indented *one* space after the
- list marker:
- .
- indented code
- paragraph
- more code
- .
- <pre><code>indented code
- </code></pre>
- <p>paragraph</p>
- <pre><code>more code
- </code></pre>
- .
- .
- 1. indented code
- paragraph
- more code
- .
- <ol>
- <li>
- <pre><code>indented code
- </code></pre>
- <p>paragraph</p>
- <pre><code>more code
- </code></pre>
- </li>
- </ol>
- .
- Note that an additional space indent is interpreted as space
- inside the code block:
- .
- 1. indented code
- paragraph
- more code
- .
- <ol>
- <li>
- <pre><code> indented code
- </code></pre>
- <p>paragraph</p>
- <pre><code>more code
- </code></pre>
- </li>
- </ol>
- .
- Note that rules #1 and #2 only apply to two cases: (a) cases
- in which the lines to be included in a list item begin with a
- [non-space character], and (b) cases in which
- they begin with an indented code
- block. In a case like the following, where the first block begins with
- a three-space indent, the rules do not allow us to form a list item by
- indenting the whole thing and prepending a list marker:
- .
- foo
- bar
- .
- <p>foo</p>
- <p>bar</p>
- .
- .
- - foo
- bar
- .
- <ul>
- <li>foo</li>
- </ul>
- <p>bar</p>
- .
- This is not a significant restriction, because when a block begins
- with 1-3 spaces indent, the indentation can always be removed without
- a change in interpretation, allowing rule #1 to be applied. So, in
- the above case:
- .
- - foo
- bar
- .
- <ul>
- <li>
- <p>foo</p>
- <p>bar</p>
- </li>
- </ul>
- .
- 3. **Item starting with a blank line.** If a sequence of lines *Ls*
- starting with a single [blank line] constitute a (possibly empty)
- sequence of blocks *Bs*, not separated from each other by more than
- one blank line, and *M* is a list marker of width *W*,
- then the result of prepending *M* to the first line of *Ls*, and
- indenting subsequent lines of *Ls* by *W + 1* spaces, is a list
- item with *Bs* as its contents.
- If a line is empty, then it need not be indented. The type of the
- list item (bullet or ordered) is determined by the type of its list
- marker. If the list item is ordered, then it is also assigned a
- start number, based on the ordered list marker.
- Here are some list items that start with a blank line but are not empty:
- .
- -
- foo
- -
- ```
- bar
- ```
- -
- baz
- .
- <ul>
- <li>foo</li>
- <li>
- <pre><code>bar
- </code></pre>
- </li>
- <li>
- <pre><code>baz
- </code></pre>
- </li>
- </ul>
- .
- Here is an empty bullet list item:
- .
- - foo
- -
- - bar
- .
- <ul>
- <li>foo</li>
- <li></li>
- <li>bar</li>
- </ul>
- .
- It does not matter whether there are spaces following the [list marker]:
- .
- - foo
- -
- - bar
- .
- <ul>
- <li>foo</li>
- <li></li>
- <li>bar</li>
- </ul>
- .
- Here is an empty ordered list item:
- .
- 1. foo
- 2.
- 3. bar
- .
- <ol>
- <li>foo</li>
- <li></li>
- <li>bar</li>
- </ol>
- .
- A list may start or end with an empty list item:
- .
- *
- .
- <ul>
- <li></li>
- </ul>
- .
- 4. **Indentation.** If a sequence of lines *Ls* constitutes a list item
- according to rule #1, #2, or #3, then the result of indenting each line
- of *Ls* by 1-3 spaces (the same for each line) also constitutes a
- list item with the same contents and attributes. If a line is
- empty, then it need not be indented.
- Indented one space:
- .
- 1. A paragraph
- with two lines.
- indented code
- > A block quote.
- .
- <ol>
- <li>
- <p>A paragraph
- with two lines.</p>
- <pre><code>indented code
- </code></pre>
- <blockquote>
- <p>A block quote.</p>
- </blockquote>
- </li>
- </ol>
- .
- Indented two spaces:
- .
- 1. A paragraph
- with two lines.
- indented code
- > A block quote.
- .
- <ol>
- <li>
- <p>A paragraph
- with two lines.</p>
- <pre><code>indented code
- </code></pre>
- <blockquote>
- <p>A block quote.</p>
- </blockquote>
- </li>
- </ol>
- .
- Indented three spaces:
- .
- 1. A paragraph
- with two lines.
- indented code
- > A block quote.
- .
- <ol>
- <li>
- <p>A paragraph
- with two lines.</p>
- <pre><code>indented code
- </code></pre>
- <blockquote>
- <p>A block quote.</p>
- </blockquote>
- </li>
- </ol>
- .
- Four spaces indent gives a code block:
- .
- 1. A paragraph
- with two lines.
- indented code
- > A block quote.
- .
- <pre><code>1. A paragraph
- with two lines.
- indented code
- > A block quote.
- </code></pre>
- .
- 5. **Laziness.** If a string of lines *Ls* constitute a [list
- item](#list-items) with contents *Bs*, then the result of deleting
- some or all of the indentation from one or more lines in which the
- next [non-space character] after the indentation is
- [paragraph continuation text] is a
- list item with the same contents and attributes. The unindented
- lines are called
- [lazy continuation line](@lazy-continuation-line)s.
- Here is an example with [lazy continuation line]s:
- .
- 1. A paragraph
- with two lines.
- indented code
- > A block quote.
- .
- <ol>
- <li>
- <p>A paragraph
- with two lines.</p>
- <pre><code>indented code
- </code></pre>
- <blockquote>
- <p>A block quote.</p>
- </blockquote>
- </li>
- </ol>
- .
- Indentation can be partially deleted:
- .
- 1. A paragraph
- with two lines.
- .
- <ol>
- <li>A paragraph
- with two lines.</li>
- </ol>
- .
- These examples show how laziness can work in nested structures:
- .
- > 1. > Blockquote
- continued here.
- .
- <blockquote>
- <ol>
- <li>
- <blockquote>
- <p>Blockquote
- continued here.</p>
- </blockquote>
- </li>
- </ol>
- </blockquote>
- .
- .
- > 1. > Blockquote
- > continued here.
- .
- <blockquote>
- <ol>
- <li>
- <blockquote>
- <p>Blockquote
- continued here.</p>
- </blockquote>
- </li>
- </ol>
- </blockquote>
- .
- 6. **That's all.** Nothing that is not counted as a list item by rules
- #1--5 counts as a [list item](#list-items).
- The rules for sublists follow from the general rules above. A sublist
- must be indented the same number of spaces a paragraph would need to be
- in order to be included in the list item.
- So, in this case we need two spaces indent:
- .
- - foo
- - bar
- - baz
- .
- <ul>
- <li>foo
- <ul>
- <li>bar
- <ul>
- <li>baz</li>
- </ul>
- </li>
- </ul>
- </li>
- </ul>
- .
- One is not enough:
- .
- - foo
- - bar
- - baz
- .
- <ul>
- <li>foo</li>
- <li>bar</li>
- <li>baz</li>
- </ul>
- .
- Here we need four, because the list marker is wider:
- .
- 10) foo
- - bar
- .
- <ol start="10">
- <li>foo
- <ul>
- <li>bar</li>
- </ul>
- </li>
- </ol>
- .
- Three is not enough:
- .
- 10) foo
- - bar
- .
- <ol start="10">
- <li>foo</li>
- </ol>
- <ul>
- <li>bar</li>
- </ul>
- .
- A list may be the first block in a list item:
- .
- - - foo
- .
- <ul>
- <li>
- <ul>
- <li>foo</li>
- </ul>
- </li>
- </ul>
- .
- .
- 1. - 2. foo
- .
- <ol>
- <li>
- <ul>
- <li>
- <ol start="2">
- <li>foo</li>
- </ol>
- </li>
- </ul>
- </li>
- </ol>
- .
- A list item can contain a header:
- .
- - # Foo
- - Bar
- ---
- baz
- .
- <ul>
- <li>
- <h1>Foo</h1>
- </li>
- <li>
- <h2>Bar</h2>
- baz</li>
- </ul>
- .
- ### Motivation
- John Gruber's Markdown spec says the following about list items:
- 1. "List markers typically start at the left margin, but may be indented
- by up to three spaces. List markers must be followed by one or more
- spaces or a tab."
- 2. "To make lists look nice, you can wrap items with hanging indents....
- But if you don't want to, you don't have to."
- 3. "List items may consist of multiple paragraphs. Each subsequent
- paragraph in a list item must be indented by either 4 spaces or one
- tab."
- 4. "It looks nice if you indent every line of the subsequent paragraphs,
- but here again, Markdown will allow you to be lazy."
- 5. "To put a blockquote within a list item, the blockquote's `>`
- delimiters need to be indented."
- 6. "To put a code block within a list item, the code block needs to be
- indented twice — 8 spaces or two tabs."
- These rules specify that a paragraph under a list item must be indented
- four spaces (presumably, from the left margin, rather than the start of
- the list marker, but this is not said), and that code under a list item
- must be indented eight spaces instead of the usual four. They also say
- that a block quote must be indented, but not by how much; however, the
- example given has four spaces indentation. Although nothing is said
- about other kinds of block-level content, it is certainly reasonable to
- infer that *all* block elements under a list item, including other
- lists, must be indented four spaces. This principle has been called the
- *four-space rule*.
- The four-space rule is clear and principled, and if the reference
- implementation `Markdown.pl` had followed it, it probably would have
- become the standard. However, `Markdown.pl` allowed paragraphs and
- sublists to start with only two spaces indentation, at least on the
- outer level. Worse, its behavior was inconsistent: a sublist of an
- outer-level list needed two spaces indentation, but a sublist of this
- sublist needed three spaces. It is not surprising, then, that different
- implementations of Markdown have developed very different rules for
- determining what comes under a list item. (Pandoc and python-Markdown,
- for example, stuck with Gruber's syntax description and the four-space
- rule, while discount, redcarpet, marked, PHP Markdown, and others
- followed `Markdown.pl`'s behavior more closely.)
- Unfortunately, given the divergences between implementations, there
- is no way to give a spec for list items that will be guaranteed not
- to break any existing documents. However, the spec given here should
- correctly handle lists formatted with either the four-space rule or
- the more forgiving `Markdown.pl` behavior, provided they are laid out
- in a way that is natural for a human to read.
- The strategy here is to let the width and indentation of the list marker
- determine the indentation necessary for blocks to fall under the list
- item, rather than having a fixed and arbitrary number. The writer can
- think of the body of the list item as a unit which gets indented to the
- right enough to fit the list marker (and any indentation on the list
- marker). (The laziness rule, #5, then allows continuation lines to be
- unindented if needed.)
- This rule is superior, we claim, to any rule requiring a fixed level of
- indentation from the margin. The four-space rule is clear but
- unnatural. It is quite unintuitive that
- ``` markdown
- - foo
- bar
- - baz
- ```
- should be parsed as two lists with an intervening paragraph,
- ``` html
- <ul>
- <li>foo</li>
- </ul>
- <p>bar</p>
- <ul>
- <li>baz</li>
- </ul>
- ```
- as the four-space rule demands, rather than a single list,
- ``` html
- <ul>
- <li>
- <p>foo</p>
- <p>bar</p>
- <ul>
- <li>baz</li>
- </ul>
- </li>
- </ul>
- ```
- The choice of four spaces is arbitrary. It can be learned, but it is
- not likely to be guessed, and it trips up beginners regularly.
- Would it help to adopt a two-space rule? The problem is that such
- a rule, together with the rule allowing 1--3 spaces indentation of the
- initial list marker, allows text that is indented *less than* the
- original list marker to be included in the list item. For example,
- `Markdown.pl` parses
- ``` markdown
- - one
- two
- ```
- as a single list item, with `two` a continuation paragraph:
- ``` html
- <ul>
- <li>
- <p>one</p>
- <p>two</p>
- </li>
- </ul>
- ```
- and similarly
- ``` markdown
- > - one
- >
- > two
- ```
- as
- ``` html
- <blockquote>
- <ul>
- <li>
- <p>one</p>
- <p>two</p>
- </li>
- </ul>
- </blockquote>
- ```
- This is extremely unintuitive.
- Rather than requiring a fixed indent from the margin, we could require
- a fixed indent (say, two spaces, or even one space) from the list marker (which
- may itself be indented). This proposal would remove the last anomaly
- discussed. Unlike the spec presented above, it would count the following
- as a list item with a subparagraph, even though the paragraph `bar`
- is not indented as far as the first paragraph `foo`:
- ``` markdown
- 10. foo
- bar
- ```
- Arguably this text does read like a list item with `bar` as a subparagraph,
- which may count in favor of the proposal. However, on this proposal indented
- code would have to be indented six spaces after the list marker. And this
- would break a lot of existing Markdown, which has the pattern:
- ``` markdown
- 1. foo
- indented code
- ```
- where the code is indented eight spaces. The spec above, by contrast, will
- parse this text as expected, since the code block's indentation is measured
- from the beginning of `foo`.
- The one case that needs special treatment is a list item that *starts*
- with indented code. How much indentation is required in that case, since
- we don't have a "first paragraph" to measure from? Rule #2 simply stipulates
- that in such cases, we require one space indentation from the list marker
- (and then the normal four spaces for the indented code). This will match the
- four-space rule in cases where the list marker plus its initial indentation
- takes four spaces (a common case), but diverge in other cases.
- ## Lists
- A [list](@list) is a sequence of one or more
- list items [of the same type]. The list items
- may be separated by single [blank lines], but two
- blank lines end all containing lists.
- Two list items are [of the same type](@of-the-same-type)
- if they begin with a [list marker] of the same type.
- Two list markers are of the
- same type if (a) they are bullet list markers using the same character
- (`-`, `+`, or `*`) or (b) they are ordered list numbers with the same
- delimiter (either `.` or `)`).
- A list is an [ordered list](@ordered-list)
- if its constituent list items begin with
- [ordered list marker]s, and a
- [bullet list](@bullet-list) if its constituent list
- items begin with [bullet list marker]s.
- The [start number](@start-number)
- of an [ordered list] is determined by the list number of
- its initial list item. The numbers of subsequent list items are
- disregarded.
- A list is [loose](@loose) if any of its constituent
- list items are separated by blank lines, or if any of its constituent
- list items directly contain two block-level elements with a blank line
- between them. Otherwise a list is [tight](@tight).
- (The difference in HTML output is that paragraphs in a loose list are
- wrapped in `<p>` tags, while paragraphs in a tight list are not.)
- Changing the bullet or ordered list delimiter starts a new list:
- .
- - foo
- - bar
- + baz
- .
- <ul>
- <li>foo</li>
- <li>bar</li>
- </ul>
- <ul>
- <li>baz</li>
- </ul>
- .
- .
- 1. foo
- 2. bar
- 3) baz
- .
- <ol>
- <li>foo</li>
- <li>bar</li>
- </ol>
- <ol start="3">
- <li>baz</li>
- </ol>
- .
- In CommonMark, a list can interrupt a paragraph. That is,
- no blank line is needed to separate a paragraph from a following
- list:
- .
- Foo
- - bar
- - baz
- .
- <p>Foo</p>
- <ul>
- <li>bar</li>
- <li>baz</li>
- </ul>
- .
- `Markdown.pl` does not allow this, through fear of triggering a list
- via a numeral in a hard-wrapped line:
- .
- The number of windows in my house is
- 14. The number of doors is 6.
- .
- <p>The number of windows in my house is</p>
- <ol start="14">
- <li>The number of doors is 6.</li>
- </ol>
- .
- Oddly, `Markdown.pl` *does* allow a blockquote to interrupt a paragraph,
- even though the same considerations might apply. We think that the two
- cases should be treated the same. Here are two reasons for allowing
- lists to interrupt paragraphs:
- First, it is natural and not uncommon for people to start lists without
- blank lines:
- I need to buy
- - new shoes
- - a coat
- - a plane ticket
- Second, we are attracted to a
- > [principle of uniformity](@principle-of-uniformity):
- > if a chunk of text has a certain
- > meaning, it will continue to have the same meaning when put into a
- > container block (such as a list item or blockquote).
- (Indeed, the spec for [list items] and [block quotes] presupposes
- this principle.) This principle implies that if
- * I need to buy
- - new shoes
- - a coat
- - a plane ticket
- is a list item containing a paragraph followed by a nested sublist,
- as all Markdown implementations agree it is (though the paragraph
- may be rendered without `<p>` tags, since the list is "tight"),
- then
- I need to buy
- - new shoes
- - a coat
- - a plane ticket
- by itself should be a paragraph followed by a nested sublist.
- Our adherence to the [principle of uniformity]
- thus inclines us to think that there are two coherent packages:
- 1. Require blank lines before *all* lists and blockquotes,
- including lists that occur as sublists inside other list items.
- 2. Require blank lines in none of these places.
- [reStructuredText](http://docutils.sourceforge.net/rst.html) takes
- the first approach, for which there is much to be said. But the second
- seems more consistent with established practice with Markdown.
- There can be blank lines between items, but two blank lines end
- a list:
- .
- - foo
- - bar
- - baz
- .
- <ul>
- <li>
- <p>foo</p>
- </li>
- <li>
- <p>bar</p>
- </li>
- </ul>
- <ul>
- <li>baz</li>
- </ul>
- .
- As illustrated above in the section on [list items],
- two blank lines between blocks *within* a list item will also end a
- list:
- .
- - foo
- bar
- - baz
- .
- <ul>
- <li>foo</li>
- </ul>
- <p>bar</p>
- <ul>
- <li>baz</li>
- </ul>
- .
- Indeed, two blank lines will end *all* containing lists:
- .
- - foo
- - bar
- - baz
- bim
- .
- <ul>
- <li>foo
- <ul>
- <li>bar
- <ul>
- <li>baz</li>
- </ul>
- </li>
- </ul>
- </li>
- </ul>
- <pre><code> bim
- </code></pre>
- .
- Thus, two blank lines can be used to separate consecutive lists of
- the same type, or to separate a list from an indented code block
- that would otherwise be parsed as a subparagraph of the final list
- item:
- .
- - foo
- - bar
- - baz
- - bim
- .
- <ul>
- <li>foo</li>
- <li>bar</li>
- </ul>
- <ul>
- <li>baz</li>
- <li>bim</li>
- </ul>
- .
- .
- - foo
- notcode
- - foo
- code
- .
- <ul>
- <li>
- <p>foo</p>
- <p>notcode</p>
- </li>
- <li>
- <p>foo</p>
- </li>
- </ul>
- <pre><code>code
- </code></pre>
- .
- List items need not be indented to the same level. The following
- list items will be treated as items at the same list level,
- since none is indented enough to belong to the previous list
- item:
- .
- - a
- - b
- - c
- - d
- - e
- - f
- - g
- - h
- - i
- .
- <ul>
- <li>a</li>
- <li>b</li>
- <li>c</li>
- <li>d</li>
- <li>e</li>
- <li>f</li>
- <li>g</li>
- <li>h</li>
- <li>i</li>
- </ul>
- .
- .
- 1. a
- 2. b
- 3. c
- .
- <ol>
- <li>
- <p>a</p>
- </li>
- <li>
- <p>b</p>
- </li>
- <li>
- <p>c</p>
- </li>
- </ol>
- .
- This is a loose list, because there is a blank line between
- two of the list items:
- .
- - a
- - b
- - c
- .
- <ul>
- <li>
- <p>a</p>
- </li>
- <li>
- <p>b</p>
- </li>
- <li>
- <p>c</p>
- </li>
- </ul>
- .
- So is this, with a empty second item:
- .
- * a
- *
- * c
- .
- <ul>
- <li>
- <p>a</p>
- </li>
- <li></li>
- <li>
- <p>c</p>
- </li>
- </ul>
- .
- These are loose lists, even though there is no space between the items,
- because one of the items directly contains two block-level elements
- with a blank line between them:
- .
- - a
- - b
- c
- - d
- .
- <ul>
- <li>
- <p>a</p>
- </li>
- <li>
- <p>b</p>
- <p>c</p>
- </li>
- <li>
- <p>d</p>
- </li>
- </ul>
- .
- .
- - a
- - b
- [ref]: /url
- - d
- .
- <ul>
- <li>
- <p>a</p>
- </li>
- <li>
- <p>b</p>
- </li>
- <li>
- <p>d</p>
- </li>
- </ul>
- .
- This is a tight list, because the blank lines are in a code block:
- .
- - a
- - ```
- b
- ```
- - c
- .
- <ul>
- <li>a</li>
- <li>
- <pre><code>b
- </code></pre>
- </li>
- <li>c</li>
- </ul>
- .
- This is a tight list, because the blank line is between two
- paragraphs of a sublist. So the sublist is loose while
- the outer list is tight:
- .
- - a
- - b
- c
- - d
- .
- <ul>
- <li>a
- <ul>
- <li>
- <p>b</p>
- <p>c</p>
- </li>
- </ul>
- </li>
- <li>d</li>
- </ul>
- .
- This is a tight list, because the blank line is inside the
- block quote:
- .
- * a
- > b
- >
- * c
- .
- <ul>
- <li>a
- <blockquote>
- <p>b</p>
- </blockquote>
- </li>
- <li>c</li>
- </ul>
- .
- This list is tight, because the consecutive block elements
- are not separated by blank lines:
- .
- - a
- > b
- ```
- c
- ```
- - d
- .
- <ul>
- <li>a
- <blockquote>
- <p>b</p>
- </blockquote>
- <pre><code>c
- </code></pre>
- </li>
- <li>d</li>
- </ul>
- .
- A single-paragraph list is tight:
- .
- - a
- .
- <ul>
- <li>a</li>
- </ul>
- .
- .
- - a
- - b
- .
- <ul>
- <li>a
- <ul>
- <li>b</li>
- </ul>
- </li>
- </ul>
- .
- This list is loose, because of the blank line between the
- two block elements in the list item:
- .
- 1. ```
- foo
- ```
- bar
- .
- <ol>
- <li>
- <pre><code>foo
- </code></pre>
- <p>bar</p>
- </li>
- </ol>
- .
- Here the outer list is loose, the inner list tight:
- .
- * foo
- * bar
- baz
- .
- <ul>
- <li>
- <p>foo</p>
- <ul>
- <li>bar</li>
- </ul>
- <p>baz</p>
- </li>
- </ul>
- .
- .
- - a
- - b
- - c
- - d
- - e
- - f
- .
- <ul>
- <li>
- <p>a</p>
- <ul>
- <li>b</li>
- <li>c</li>
- </ul>
- </li>
- <li>
- <p>d</p>
- <ul>
- <li>e</li>
- <li>f</li>
- </ul>
- </li>
- </ul>
- .
- # Inlines
- Inlines are parsed sequentially from the beginning of the character
- stream to the end (left to right, in left-to-right languages).
- Thus, for example, in
- .
- `hi`lo`
- .
- <p><code>hi</code>lo`</p>
- .
- `hi` is parsed as code, leaving the backtick at the end as a literal
- backtick.
- ## Backslash escapes
- Any ASCII punctuation character may be backslash-escaped:
- .
- \!\"\#\$\%\&\'\(\)\*\+\,\-\.\/\:\;\<\=\>\?\@\[\\\]\^\_\`\{\|\}\~
- .
- <p>!"#$%&'()*+,-./:;<=>?@[\]^_`{|}~</p>
- .
- Backslashes before other characters are treated as literal
- backslashes:
- .
- \→\A\a\ \3\φ\«
- .
- <p>\ \A\a\ \3\φ\«</p>
- .
- Escaped characters are treated as regular characters and do
- not have their usual Markdown meanings:
- .
- \*not emphasized*
- \<br/> not a tag
- \[not a link](/foo)
- \`not code`
- 1\. not a list
- \* not a list
- \# not a header
- \[foo]: /url "not a reference"
- .
- <p>*not emphasized*
- <br/> not a tag
- [not a link](/foo)
- `not code`
- 1. not a list
- * not a list
- # not a header
- [foo]: /url "not a reference"</p>
- .
- If a backslash is itself escaped, the following character is not:
- .
- \\*emphasis*
- .
- <p>\<em>emphasis</em></p>
- .
- A backslash at the end of the line is a [hard line break]:
- .
- foo\
- bar
- .
- <p>foo<br />
- bar</p>
- .
- Backslash escapes do not work in code blocks, code spans, autolinks, or
- raw HTML:
- .
- `` \[\` ``
- .
- <p><code>\[\`</code></p>
- .
- .
- \[\]
- .
- <pre><code>\[\]
- </code></pre>
- .
- .
- ~~~
- \[\]
- ~~~
- .
- <pre><code>\[\]
- </code></pre>
- .
- .
- <http://example.com?find=\*>
- .
- <p><a href="http://example.com?find=%5C*">http://example.com?find=\*</a></p>
- .
- .
- <a href="/bar\/)">
- .
- <p><a href="/bar\/)"></p>
- .
- But they work in all other contexts, including URLs and link titles,
- link references, and [info string]s in [fenced code block]s:
- .
- [foo](/bar\* "ti\*tle")
- .
- <p><a href="/bar*" title="ti*tle">foo</a></p>
- .
- .
- [foo]
- [foo]: /bar\* "ti\*tle"
- .
- <p><a href="/bar*" title="ti*tle">foo</a></p>
- .
- .
- ``` foo\+bar
- foo
- ```
- .
- <pre><code class="language-foo+bar">foo
- </code></pre>
- .
- ## Entities
- With the goal of making this standard as HTML-agnostic as possible, all
- valid HTML entities (except in code blocks and code spans)
- are recognized as such and converted into unicode characters before
- they are stored in the AST. This means that renderers to formats other
- than HTML need not be HTML-entity aware. HTML renderers may either escape
- unicode characters as entities or leave them as they are. (However,
- `"`, `&`, `<`, and `>` must always be rendered as entities.)
- [Named entities](@name-entities) consist of `&`
- + any of the valid HTML5 entity names + `;`. The
- [following document](https://html.spec.whatwg.org/multipage/entities.json)
- is used as an authoritative source of the valid entity names and their
- corresponding codepoints.
- .
- & © Æ Ď ¾ ℋ ⅆ ∲
- .
- <p> & © Æ Ď ¾ ℋ ⅆ ∲</p>
- .
- [Decimal entities](@decimal-entities)
- consist of `&#` + a string of 1--8 arabic digits + `;`. Again, these
- entities need to be recognised and transformed into their corresponding
- unicode codepoints. Invalid unicode codepoints will be replaced by
- the "unknown codepoint" character (`U+FFFD`). For security reasons,
- the codepoint `U+0000` will also be replaced by `U+FFFD`.
- .
- # Ӓ Ϡ � �
- .
- <p># Ӓ Ϡ � �</p>
- .
- [Hexadecimal entities](@hexadecimal-entities)
- consist of `&#` + either `X` or `x` + a string of 1-8 hexadecimal digits
- + `;`. They will also be parsed and turned into the corresponding
- unicode codepoints in the AST.
- .
- " ആ ಫ
- .
- <p>" ആ ಫ</p>
- .
- Here are some nonentities:
- .
-   &x; &#; &#x; &ThisIsWayTooLongToBeAnEntityIsntIt; &hi?;
- .
- <p>&nbsp &x; &#; &#x; &ThisIsWayTooLongToBeAnEntityIsntIt; &hi?;</p>
- .
- Although HTML5 does accept some entities without a trailing semicolon
- (such as `©`), these are not recognized as entities here, because it
- makes the grammar too ambiguous:
- .
- ©
- .
- <p>&copy</p>
- .
- Strings that are not on the list of HTML5 named entities are not
- recognized as entities either:
- .
- &MadeUpEntity;
- .
- <p>&MadeUpEntity;</p>
- .
- Entities are recognized in any context besides code spans or
- code blocks, including raw HTML, URLs, [link title]s, and
- [fenced code block] [info string]s:
- .
- <a href="öö.html">
- .
- <p><a href="öö.html"></p>
- .
- .
- [foo](/föö "föö")
- .
- <p><a href="/f%C3%B6%C3%B6" title="föö">foo</a></p>
- .
- .
- [foo]
- [foo]: /föö "föö"
- .
- <p><a href="/f%C3%B6%C3%B6" title="föö">foo</a></p>
- .
- .
- ``` föö
- foo
- ```
- .
- <pre><code class="language-föö">foo
- </code></pre>
- .
- Entities are treated as literal text in code spans and code blocks:
- .
- `föö`
- .
- <p><code>f&ouml;&ouml;</code></p>
- .
- .
- föfö
- .
- <pre><code>f&ouml;f&ouml;
- </code></pre>
- .
- ## Code spans
- A [backtick string](@backtick-string)
- is a string of one or more backtick characters (`` ` ``) that is neither
- preceded nor followed by a backtick.
- A [code span](@code-span) begins with a backtick string and ends with
- a backtick string of equal length. The contents of the code span are
- the characters between the two backtick strings, with leading and
- trailing spaces and [line ending]s removed, and
- [whitespace] collapsed to single spaces.
- This is a simple code span:
- .
- `foo`
- .
- <p><code>foo</code></p>
- .
- Here two backticks are used, because the code contains a backtick.
- This example also illustrates stripping of leading and trailing spaces:
- .
- `` foo ` bar ``
- .
- <p><code>foo ` bar</code></p>
- .
- This example shows the motivation for stripping leading and trailing
- spaces:
- .
- ` `` `
- .
- <p><code>``</code></p>
- .
- [Line ending]s are treated like spaces:
- .
- ``
- foo
- ``
- .
- <p><code>foo</code></p>
- .
- Interior spaces and [line ending]s are collapsed into
- single spaces, just as they would be by a browser:
- .
- `foo bar
- baz`
- .
- <p><code>foo bar baz</code></p>
- .
- Q: Why not just leave the spaces, since browsers will collapse them
- anyway? A: Because we might be targeting a non-HTML format, and we
- shouldn't rely on HTML-specific rendering assumptions.
- (Existing implementations differ in their treatment of internal
- spaces and [line ending]s. Some, including `Markdown.pl` and
- `showdown`, convert an internal [line ending] into a
- `<br />` tag. But this makes things difficult for those who like to
- hard-wrap their paragraphs, since a line break in the midst of a code
- span will cause an unintended line break in the output. Others just
- leave internal spaces as they are, which is fine if only HTML is being
- targeted.)
- .
- `foo `` bar`
- .
- <p><code>foo `` bar</code></p>
- .
- Note that backslash escapes do not work in code spans. All backslashes
- are treated literally:
- .
- `foo\`bar`
- .
- <p><code>foo\</code>bar`</p>
- .
- Backslash escapes are never needed, because one can always choose a
- string of *n* backtick characters as delimiters, where the code does
- not contain any strings of exactly *n* backtick characters.
- Code span backticks have higher precedence than any other inline
- constructs except HTML tags and autolinks. Thus, for example, this is
- not parsed as emphasized text, since the second `*` is part of a code
- span:
- .
- *foo`*`
- .
- <p>*foo<code>*</code></p>
- .
- And this is not parsed as a link:
- .
- [not a `link](/foo`)
- .
- <p>[not a <code>link](/foo</code>)</p>
- .
- Code spans, HTML tags, and autolinks have the same precedence.
- Thus, this is code:
- .
- `<a href="`">`
- .
- <p><code><a href="</code>">`</p>
- .
- But this is an HTML tag:
- .
- <a href="`">`
- .
- <p><a href="`">`</p>
- .
- And this is code:
- .
- `<http://foo.bar.`baz>`
- .
- <p><code><http://foo.bar.</code>baz>`</p>
- .
- But this is an autolink:
- .
- <http://foo.bar.`baz>`
- .
- <p><a href="http://foo.bar.%60baz">http://foo.bar.`baz</a>`</p>
- .
- When a backtick string is not closed by a matching backtick string,
- we just have literal backticks:
- .
- ```foo``
- .
- <p>```foo``</p>
- .
- .
- `foo
- .
- <p>`foo</p>
- .
- ## Emphasis and strong emphasis
- John Gruber's original [Markdown syntax
- description](http://daringfireball.net/projects/markdown/syntax#em) says:
- > Markdown treats asterisks (`*`) and underscores (`_`) as indicators of
- > emphasis. Text wrapped with one `*` or `_` will be wrapped with an HTML
- > `<em>` tag; double `*`'s or `_`'s will be wrapped with an HTML `<strong>`
- > tag.
- This is enough for most users, but these rules leave much undecided,
- especially when it comes to nested emphasis. The original
- `Markdown.pl` test suite makes it clear that triple `***` and
- `___` delimiters can be used for strong emphasis, and most
- implementations have also allowed the following patterns:
- ``` markdown
- ***strong emph***
- ***strong** in emph*
- ***emph* in strong**
- **in strong *emph***
- *in emph **strong***
- ```
- The following patterns are less widely supported, but the intent
- is clear and they are useful (especially in contexts like bibliography
- entries):
- ``` markdown
- *emph *with emph* in it*
- **strong **with strong** in it**
- ```
- Many implementations have also restricted intraword emphasis to
- the `*` forms, to avoid unwanted emphasis in words containing
- internal underscores. (It is best practice to put these in code
- spans, but users often do not.)
- ``` markdown
- internal emphasis: foo*bar*baz
- no emphasis: foo_bar_baz
- ```
- The rules given below capture all of these patterns, while allowing
- for efficient parsing strategies that do not backtrack.
- First, some definitions. A [delimiter run](@delimiter-run) is either
- a sequence of one or more `*` characters that is not preceded or
- followed by a `*` character, or a sequence of one or more `_`
- characters that is not preceded or followed by a `_` character.
- A [left-flanking delimiter run](@left-flanking-delimiter-run) is
- a [delimiter run] that is (a) not followed by [unicode whitespace],
- and (b) either not followed by a [punctuation character], or
- preceded by [unicode whitespace] or a [punctuation character].
- For purposes of this definition, the beginning and the end of
- the line count as unicode whitespace.
- A [right-flanking delimiter run](@right-flanking-delimiter-run) is
- a [delimiter run] that is (a) not preceded by [unicode whitespace],
- and (b) either not preceded by a [punctuation character], or
- followed by [unicode whitespace] or a [punctuation character].
- For purposes of this definition, the beginning and the end of
- the line count as unicode whitespace.
- Here are some examples of delimiter runs.
- - left-flanking but not right-flanking:
- ```
- ***abc
- _abc
- **"abc"
- _"abc"
- ```
- - right-flanking but not left-flanking:
- ```
- abc***
- abc_
- "abc"**
- "abc"_
- ```
- - Both left and right-flanking:
- ```
- abc***def
- "abc"_"def"
- ```
- - Neither left nor right-flanking:
- ```
- abc *** def
- a _ b
- ```
- (The idea of distinguishing left-flanking and right-flanking
- delimiter runs based on the character before and the character
- after comes from Roopesh Chander's
- [vfmd](http://www.vfmd.org/vfmd-spec/specification/#procedure-for-identifying-emphasis-tags).
- vfmd uses the terminology "emphasis indicator string" instead of "delimiter
- run," and its rules for distinguishing left- and right-flanking runs
- are a bit more complex than the ones given here.)
- The following rules define emphasis and strong emphasis:
- 1. A single `*` character [can open emphasis](@can-open-emphasis)
- iff (if and only if) it is part of a [left-flanking delimiter run].
- 2. A single `_` character [can open emphasis] iff
- it is part of a [left-flanking delimiter run]
- and either (a) not part of a [right-flanking delimiter run]
- or (b) part of a [right-flanking delimeter run]
- preceded by punctuation.
- 3. A single `*` character [can close emphasis](@can-close-emphasis)
- iff it is part of a [right-flanking delimiter run].
- 4. A single `_` character [can close emphasis] iff
- it is part of a [right-flanking delimiter run]
- and either (a) not part of a [left-flanking delimiter run]
- or (b) part of a [left-flanking delimeter run]
- followed by punctuation.
- 5. A double `**` [can open strong emphasis](@can-open-strong-emphasis)
- iff it is part of a [left-flanking delimiter run].
- 6. A double `__` [can open strong emphasis] iff
- it is part of a [left-flanking delimiter run]
- and either (a) not part of a [right-flanking delimiter run]
- or (b) part of a [right-flanking delimeter run]
- preceded by punctuation.
- 7. A double `**` [can close strong emphasis](@can-close-strong-emphasis)
- iff it is part of a [right-flanking delimiter run].
- 8. A double `__` [can close strong emphasis]
- it is part of a [right-flanking delimiter run]
- and either (a) not part of a [left-flanking delimiter run]
- or (b) part of a [left-flanking delimeter run]
- followed by punctuation.
- 9. Emphasis begins with a delimiter that [can open emphasis] and ends
- with a delimiter that [can close emphasis], and that uses the same
- character (`_` or `*`) as the opening delimiter. There must
- be a nonempty sequence of inlines between the open delimiter
- and the closing delimiter; these form the contents of the emphasis
- inline.
- 10. Strong emphasis begins with a delimiter that
- [can open strong emphasis] and ends with a delimiter that
- [can close strong emphasis], and that uses the same character
- (`_` or `*`) as the opening delimiter.
- There must be a nonempty sequence of inlines between the open
- delimiter and the closing delimiter; these form the contents of
- the strong emphasis inline.
- 11. A literal `*` character cannot occur at the beginning or end of
- `*`-delimited emphasis or `**`-delimited strong emphasis, unless it
- is backslash-escaped.
- 12. A literal `_` character cannot occur at the beginning or end of
- `_`-delimited emphasis or `__`-delimited strong emphasis, unless it
- is backslash-escaped.
- Where rules 1--12 above are compatible with multiple parsings,
- the following principles resolve ambiguity:
- 13. The number of nestings should be minimized. Thus, for example,
- an interpretation `<strong>...</strong>` is always preferred to
- `<em><em>...</em></em>`.
- 14. An interpretation `<strong><em>...</em></strong>` is always
- preferred to `<em><strong>..</strong></em>`.
- 15. When two potential emphasis or strong emphasis spans overlap,
- so that the second begins before the first ends and ends after
- the first ends, the first takes precedence. Thus, for example,
- `*foo _bar* baz_` is parsed as `<em>foo _bar</em> baz_` rather
- than `*foo <em>bar* baz</em>`. For the same reason,
- `**foo*bar**` is parsed as `<em><em>foo</em>bar</em>*`
- rather than `<strong>foo*bar</strong>`.
- 16. When there are two potential emphasis or strong emphasis spans
- with the same closing delimiter, the shorter one (the one that
- opens later) takes precedence. Thus, for example,
- `**foo **bar baz**` is parsed as `**foo <strong>bar baz</strong>`
- rather than `<strong>foo **bar baz</strong>`.
- 17. Inline code spans, links, images, and HTML tags group more tightly
- than emphasis. So, when there is a choice between an interpretation
- that contains one of these elements and one that does not, the
- former always wins. Thus, for example, `*[foo*](bar)` is
- parsed as `*<a href="bar">foo*</a>` rather than as
- `<em>[foo</em>](bar)`.
- These rules can be illustrated through a series of examples.
- Rule 1:
- .
- *foo bar*
- .
- <p><em>foo bar</em></p>
- .
- This is not emphasis, because the opening `*` is followed by
- whitespace, and hence not part of a [left-flanking delimiter run]:
- .
- a * foo bar*
- .
- <p>a * foo bar*</p>
- .
- This is not emphasis, because the opening `*` is preceded
- by an alphanumeric and followed by punctuation, and hence
- not part of a [left-flanking delimiter run]:
- .
- a*"foo"*
- .
- <p>a*"foo"*</p>
- .
- Unicode nonbreaking spaces count as whitespace, too:
- .
- * a *
- .
- <p>* a *</p>
- .
- Intraword emphasis with `*` is permitted:
- .
- foo*bar*
- .
- <p>foo<em>bar</em></p>
- .
- .
- 5*6*78
- .
- <p>5<em>6</em>78</p>
- .
- Rule 2:
- .
- _foo bar_
- .
- <p><em>foo bar</em></p>
- .
- This is not emphasis, because the opening `_` is followed by
- whitespace:
- .
- _ foo bar_
- .
- <p>_ foo bar_</p>
- .
- This is not emphasis, because the opening `_` is preceded
- by an alphanumeric and followed by punctuation:
- .
- a_"foo"_
- .
- <p>a_"foo"_</p>
- .
- Emphasis with `_` is not allowed inside words:
- .
- foo_bar_
- .
- <p>foo_bar_</p>
- .
- .
- 5_6_78
- .
- <p>5_6_78</p>
- .
- .
- пристаням_стремятся_
- .
- <p>пристаням_стремятся_</p>
- .
- Here `_` does not generate emphasis, because the first delimiter run
- is right-flanking and the second left-flanking:
- .
- aa_"bb"_cc
- .
- <p>aa_"bb"_cc</p>
- .
- This is emphasis, even though the opening delimiter is
- both left- and right-flanking, because it is preceded by
- punctuation:
- .
- foo-_(bar)_
- .
- <p>foo-<em>(bar)</em></p>
- .
- Rule 3:
- This is not emphasis, because the closing delimiter does
- not match the opening delimiter:
- .
- _foo*
- .
- <p>_foo*</p>
- .
- This is not emphasis, because the closing `*` is preceded by
- whitespace:
- .
- *foo bar *
- .
- <p>*foo bar *</p>
- .
- A newline also counts as whitespace:
- .
- *foo bar
- *
- .
- <p>*foo bar</p>
- <ul>
- <li></li>
- </ul>
- .
- This is not emphasis, because the second `*` is
- preceded by punctuation and followed by an alphanumeric
- (hence it is not part of a [right-flanking delimiter run]:
- .
- *(*foo)
- .
- <p>*(*foo)</p>
- .
- The point of this restriction is more easily appreciated
- with this example:
- .
- *(*foo*)*
- .
- <p><em>(<em>foo</em>)</em></p>
- .
- Intraword emphasis with `*` is allowed:
- .
- *foo*bar
- .
- <p><em>foo</em>bar</p>
- .
- Rule 4:
- This is not emphasis, because the closing `_` is preceded by
- whitespace:
- .
- _foo bar _
- .
- <p>_foo bar _</p>
- .
- This is not emphasis, because the second `_` is
- preceded by punctuation and followed by an alphanumeric:
- .
- _(_foo)
- .
- <p>_(_foo)</p>
- .
- This is emphasis within emphasis:
- .
- _(_foo_)_
- .
- <p><em>(<em>foo</em>)</em></p>
- .
- Intraword emphasis is disallowed for `_`:
- .
- _foo_bar
- .
- <p>_foo_bar</p>
- .
- .
- _пристаням_стремятся
- .
- <p>_пристаням_стремятся</p>
- .
- .
- _foo_bar_baz_
- .
- <p><em>foo_bar_baz</em></p>
- .
- This is emphasis, even though the closing delimiter is
- both left- and right-flanking, because it is followed by
- punctuation:
- .
- _(bar)_.
- .
- <p><em>(bar)</em>.</p>
- .
- Rule 5:
- .
- **foo bar**
- .
- <p><strong>foo bar</strong></p>
- .
- This is not strong emphasis, because the opening delimiter is
- followed by whitespace:
- .
- ** foo bar**
- .
- <p>** foo bar**</p>
- .
- This is not strong emphasis, because the opening `**` is preceded
- by an alphanumeric and followed by punctuation, and hence
- not part of a [left-flanking delimiter run]:
- .
- a**"foo"**
- .
- <p>a**"foo"**</p>
- .
- Intraword strong emphasis with `**` is permitted:
- .
- foo**bar**
- .
- <p>foo<strong>bar</strong></p>
- .
- Rule 6:
- .
- __foo bar__
- .
- <p><strong>foo bar</strong></p>
- .
- This is not strong emphasis, because the opening delimiter is
- followed by whitespace:
- .
- __ foo bar__
- .
- <p>__ foo bar__</p>
- .
- A newline counts as whitespace:
- .
- __
- foo bar__
- .
- <p>__
- foo bar__</p>
- .
- This is not strong emphasis, because the opening `__` is preceded
- by an alphanumeric and followed by punctuation:
- .
- a__"foo"__
- .
- <p>a__"foo"__</p>
- .
- Intraword strong emphasis is forbidden with `__`:
- .
- foo__bar__
- .
- <p>foo__bar__</p>
- .
- .
- 5__6__78
- .
- <p>5__6__78</p>
- .
- .
- пристаням__стремятся__
- .
- <p>пристаням__стремятся__</p>
- .
- .
- __foo, __bar__, baz__
- .
- <p><strong>foo, <strong>bar</strong>, baz</strong></p>
- .
- This is strong emphasis, even though the opening delimiter is
- both left- and right-flanking, because it is preceded by
- punctuation:
- .
- foo-__(bar)__
- .
- <p>foo-<strong>(bar)</strong></p>
- .
- Rule 7:
- This is not strong emphasis, because the closing delimiter is preceded
- by whitespace:
- .
- **foo bar **
- .
- <p>**foo bar **</p>
- .
- (Nor can it be interpreted as an emphasized `*foo bar *`, because of
- Rule 11.)
- This is not strong emphasis, because the second `**` is
- preceded by punctuation and followed by an alphanumeric:
- .
- **(**foo)
- .
- <p>**(**foo)</p>
- .
- The point of this restriction is more easily appreciated
- with these examples:
- .
- *(**foo**)*
- .
- <p><em>(<strong>foo</strong>)</em></p>
- .
- .
- **Gomphocarpus (*Gomphocarpus physocarpus*, syn.
- *Asclepias physocarpa*)**
- .
- <p><strong>Gomphocarpus (<em>Gomphocarpus physocarpus</em>, syn.
- <em>Asclepias physocarpa</em>)</strong></p>
- .
- .
- **foo "*bar*" foo**
- .
- <p><strong>foo "<em>bar</em>" foo</strong></p>
- .
- Intraword emphasis:
- .
- **foo**bar
- .
- <p><strong>foo</strong>bar</p>
- .
- Rule 8:
- This is not strong emphasis, because the closing delimiter is
- preceded by whitespace:
- .
- __foo bar __
- .
- <p>__foo bar __</p>
- .
- This is not strong emphasis, because the second `__` is
- preceded by punctuation and followed by an alphanumeric:
- .
- __(__foo)
- .
- <p>__(__foo)</p>
- .
- The point of this restriction is more easily appreciated
- with this example:
- .
- _(__foo__)_
- .
- <p><em>(<strong>foo</strong>)</em></p>
- .
- Intraword strong emphasis is forbidden with `__`:
- .
- __foo__bar
- .
- <p>__foo__bar</p>
- .
- .
- __пристаням__стремятся
- .
- <p>__пристаням__стремятся</p>
- .
- .
- __foo__bar__baz__
- .
- <p><strong>foo__bar__baz</strong></p>
- .
- This is strong emphasis, even though the closing delimiter is
- both left- and right-flanking, because it is followed by
- punctuation:
- .
- __(bar)__.
- .
- <p><strong>(bar)</strong>.</p>
- .
- Rule 9:
- Any nonempty sequence of inline elements can be the contents of an
- emphasized span.
- .
- *foo [bar](/url)*
- .
- <p><em>foo <a href="/url">bar</a></em></p>
- .
- .
- *foo
- bar*
- .
- <p><em>foo
- bar</em></p>
- .
- In particular, emphasis and strong emphasis can be nested
- inside emphasis:
- .
- _foo __bar__ baz_
- .
- <p><em>foo <strong>bar</strong> baz</em></p>
- .
- .
- _foo _bar_ baz_
- .
- <p><em>foo <em>bar</em> baz</em></p>
- .
- .
- __foo_ bar_
- .
- <p><em><em>foo</em> bar</em></p>
- .
- .
- *foo *bar**
- .
- <p><em>foo <em>bar</em></em></p>
- .
- .
- *foo **bar** baz*
- .
- <p><em>foo <strong>bar</strong> baz</em></p>
- .
- But note:
- .
- *foo**bar**baz*
- .
- <p><em>foo</em><em>bar</em><em>baz</em></p>
- .
- The difference is that in the preceding case, the internal delimiters
- [can close emphasis], while in the cases with spaces, they cannot.
- .
- ***foo** bar*
- .
- <p><em><strong>foo</strong> bar</em></p>
- .
- .
- *foo **bar***
- .
- <p><em>foo <strong>bar</strong></em></p>
- .
- Note, however, that in the following case we get no strong
- emphasis, because the opening delimiter is closed by the first
- `*` before `bar`:
- .
- *foo**bar***
- .
- <p><em>foo</em><em>bar</em>**</p>
- .
- Indefinite levels of nesting are possible:
- .
- *foo **bar *baz* bim** bop*
- .
- <p><em>foo <strong>bar <em>baz</em> bim</strong> bop</em></p>
- .
- .
- *foo [*bar*](/url)*
- .
- <p><em>foo <a href="/url"><em>bar</em></a></em></p>
- .
- There can be no empty emphasis or strong emphasis:
- .
- ** is not an empty emphasis
- .
- <p>** is not an empty emphasis</p>
- .
- .
- **** is not an empty strong emphasis
- .
- <p>**** is not an empty strong emphasis</p>
- .
- Rule 10:
- Any nonempty sequence of inline elements can be the contents of an
- strongly emphasized span.
- .
- **foo [bar](/url)**
- .
- <p><strong>foo <a href="/url">bar</a></strong></p>
- .
- .
- **foo
- bar**
- .
- <p><strong>foo
- bar</strong></p>
- .
- In particular, emphasis and strong emphasis can be nested
- inside strong emphasis:
- .
- __foo _bar_ baz__
- .
- <p><strong>foo <em>bar</em> baz</strong></p>
- .
- .
- __foo __bar__ baz__
- .
- <p><strong>foo <strong>bar</strong> baz</strong></p>
- .
- .
- ____foo__ bar__
- .
- <p><strong><strong>foo</strong> bar</strong></p>
- .
- .
- **foo **bar****
- .
- <p><strong>foo <strong>bar</strong></strong></p>
- .
- .
- **foo *bar* baz**
- .
- <p><strong>foo <em>bar</em> baz</strong></p>
- .
- But note:
- .
- **foo*bar*baz**
- .
- <p><em><em>foo</em>bar</em>baz**</p>
- .
- The difference is that in the preceding case, the internal delimiters
- [can close emphasis], while in the cases with spaces, they cannot.
- .
- ***foo* bar**
- .
- <p><strong><em>foo</em> bar</strong></p>
- .
- .
- **foo *bar***
- .
- <p><strong>foo <em>bar</em></strong></p>
- .
- Indefinite levels of nesting are possible:
- .
- **foo *bar **baz**
- bim* bop**
- .
- <p><strong>foo <em>bar <strong>baz</strong>
- bim</em> bop</strong></p>
- .
- .
- **foo [*bar*](/url)**
- .
- <p><strong>foo <a href="/url"><em>bar</em></a></strong></p>
- .
- There can be no empty emphasis or strong emphasis:
- .
- __ is not an empty emphasis
- .
- <p>__ is not an empty emphasis</p>
- .
- .
- ____ is not an empty strong emphasis
- .
- <p>____ is not an empty strong emphasis</p>
- .
- Rule 11:
- .
- foo ***
- .
- <p>foo ***</p>
- .
- .
- foo *\**
- .
- <p>foo <em>*</em></p>
- .
- .
- foo *_*
- .
- <p>foo <em>_</em></p>
- .
- .
- foo *****
- .
- <p>foo *****</p>
- .
- .
- foo **\***
- .
- <p>foo <strong>*</strong></p>
- .
- .
- foo **_**
- .
- <p>foo <strong>_</strong></p>
- .
- Note that when delimiters do not match evenly, Rule 11 determines
- that the excess literal `*` characters will appear outside of the
- emphasis, rather than inside it:
- .
- **foo*
- .
- <p>*<em>foo</em></p>
- .
- .
- *foo**
- .
- <p><em>foo</em>*</p>
- .
- .
- ***foo**
- .
- <p>*<strong>foo</strong></p>
- .
- .
- ****foo*
- .
- <p>***<em>foo</em></p>
- .
- .
- **foo***
- .
- <p><strong>foo</strong>*</p>
- .
- .
- *foo****
- .
- <p><em>foo</em>***</p>
- .
- Rule 12:
- .
- foo ___
- .
- <p>foo ___</p>
- .
- .
- foo _\__
- .
- <p>foo <em>_</em></p>
- .
- .
- foo _*_
- .
- <p>foo <em>*</em></p>
- .
- .
- foo _____
- .
- <p>foo _____</p>
- .
- .
- foo __\___
- .
- <p>foo <strong>_</strong></p>
- .
- .
- foo __*__
- .
- <p>foo <strong>*</strong></p>
- .
- .
- __foo_
- .
- <p>_<em>foo</em></p>
- .
- Note that when delimiters do not match evenly, Rule 12 determines
- that the excess literal `_` characters will appear outside of the
- emphasis, rather than inside it:
- .
- _foo__
- .
- <p><em>foo</em>_</p>
- .
- .
- ___foo__
- .
- <p>_<strong>foo</strong></p>
- .
- .
- ____foo_
- .
- <p>___<em>foo</em></p>
- .
- .
- __foo___
- .
- <p><strong>foo</strong>_</p>
- .
- .
- _foo____
- .
- <p><em>foo</em>___</p>
- .
- Rule 13 implies that if you want emphasis nested directly inside
- emphasis, you must use different delimiters:
- .
- **foo**
- .
- <p><strong>foo</strong></p>
- .
- .
- *_foo_*
- .
- <p><em><em>foo</em></em></p>
- .
- .
- __foo__
- .
- <p><strong>foo</strong></p>
- .
- .
- _*foo*_
- .
- <p><em><em>foo</em></em></p>
- .
- However, strong emphasis within strong emphasis is possible without
- switching delimiters:
- .
- ****foo****
- .
- <p><strong><strong>foo</strong></strong></p>
- .
- .
- ____foo____
- .
- <p><strong><strong>foo</strong></strong></p>
- .
- Rule 13 can be applied to arbitrarily long sequences of
- delimiters:
- .
- ******foo******
- .
- <p><strong><strong><strong>foo</strong></strong></strong></p>
- .
- Rule 14:
- .
- ***foo***
- .
- <p><strong><em>foo</em></strong></p>
- .
- .
- _____foo_____
- .
- <p><strong><strong><em>foo</em></strong></strong></p>
- .
- Rule 15:
- .
- *foo _bar* baz_
- .
- <p><em>foo _bar</em> baz_</p>
- .
- .
- **foo*bar**
- .
- <p><em><em>foo</em>bar</em>*</p>
- .
- Rule 16:
- .
- **foo **bar baz**
- .
- <p>**foo <strong>bar baz</strong></p>
- .
- .
- *foo *bar baz*
- .
- <p>*foo <em>bar baz</em></p>
- .
- Rule 17:
- .
- *[bar*](/url)
- .
- <p>*<a href="/url">bar*</a></p>
- .
- .
- _foo [bar_](/url)
- .
- <p>_foo <a href="/url">bar_</a></p>
- .
- .
- *<img src="foo" title="*"/>
- .
- <p>*<img src="foo" title="*"/></p>
- .
- .
- **<a href="**">
- .
- <p>**<a href="**"></p>
- .
- .
- __<a href="__">
- .
- <p>__<a href="__"></p>
- .
- .
- *a `*`*
- .
- <p><em>a <code>*</code></em></p>
- .
- .
- _a `_`_
- .
- <p><em>a <code>_</code></em></p>
- .
- .
- **a<http://foo.bar/?q=**>
- .
- <p>**a<a href="http://foo.bar/?q=**">http://foo.bar/?q=**</a></p>
- .
- .
- __a<http://foo.bar/?q=__>
- .
- <p>__a<a href="http://foo.bar/?q=__">http://foo.bar/?q=__</a></p>
- .
- ## Links
- A link contains [link text] (the visible text), a [link destination]
- (the URI that is the link destination), and optionally a [link title].
- There are two basic kinds of links in Markdown. In [inline link]s the
- destination and title are given immediately after the link text. In
- [reference link]s the destination and title are defined elsewhere in
- the document.
- A [link text](@link-text) consists of a sequence of zero or more
- inline elements enclosed by square brackets (`[` and `]`). The
- following rules apply:
- - Links may not contain other links, at any level of nesting.
- - Brackets are allowed in the [link text] only if (a) they
- are backslash-escaped or (b) they appear as a matched pair of brackets,
- with an open bracket `[`, a sequence of zero or more inlines, and
- a close bracket `]`.
- - Backtick [code span]s, [autolink]s, and raw [HTML tag]s bind more tightly
- than the brackets in link text. Thus, for example,
- `` [foo`]` `` could not be a link text, since the second `]`
- is part of a code span.
- - The brackets in link text bind more tightly than markers for
- [emphasis and strong emphasis]. Thus, for example, `*[foo*](url)` is a link.
- A [link destination](@link-destination) consists of either
- - a sequence of zero or more characters between an opening `<` and a
- closing `>` that contains no line breaks or unescaped `<` or `>`
- characters, or
- - a nonempty sequence of characters that does not include
- ASCII space or control characters, and includes parentheses
- only if (a) they are backslash-escaped or (b) they are part of
- a balanced pair of unescaped parentheses that is not itself
- inside a balanced pair of unescaped parentheses.
- A [link title](@link-title) consists of either
- - a sequence of zero or more characters between straight double-quote
- characters (`"`), including a `"` character only if it is
- backslash-escaped, or
- - a sequence of zero or more characters between straight single-quote
- characters (`'`), including a `'` character only if it is
- backslash-escaped, or
- - a sequence of zero or more characters between matching parentheses
- (`(...)`), including a `)` character only if it is backslash-escaped.
- Although [link title]s may span multiple lines, they may not contain
- a [blank line].
- An [inline link](@inline-link) consists of a [link text] followed immediately
- by a left parenthesis `(`, optional [whitespace], an optional
- [link destination], an optional [link title] separated from the link
- destination by [whitespace], optional [whitespace], and a right
- parenthesis `)`. The link's text consists of the inlines contained
- in the [link text] (excluding the enclosing square brackets).
- The link's URI consists of the link destination, excluding enclosing
- `<...>` if present, with backslash-escapes in effect as described
- above. The link's title consists of the link title, excluding its
- enclosing delimiters, with backslash-escapes in effect as described
- above.
- Here is a simple inline link:
- .
- [link](/uri "title")
- .
- <p><a href="/uri" title="title">link</a></p>
- .
- The title may be omitted:
- .
- [link](/uri)
- .
- <p><a href="/uri">link</a></p>
- .
- Both the title and the destination may be omitted:
- .
- [link]()
- .
- <p><a href="">link</a></p>
- .
- .
- [link](<>)
- .
- <p><a href="">link</a></p>
- .
- If the destination contains spaces, it must be enclosed in pointy
- braces:
- .
- [link](/my uri)
- .
- <p>[link](/my uri)</p>
- .
- .
- [link](</my uri>)
- .
- <p><a href="/my%20uri">link</a></p>
- .
- The destination cannot contain line breaks, even with pointy braces:
- .
- [link](foo
- bar)
- .
- <p>[link](foo
- bar)</p>
- .
- .
- [link](<foo
- bar>)
- .
- <p>[link](<foo
- bar>)</p>
- .
- One level of balanced parentheses is allowed without escaping:
- .
- [link]((foo)and(bar))
- .
- <p><a href="(foo)and(bar)">link</a></p>
- .
- However, if you have parentheses within parentheses, you need to escape
- or use the `<...>` form:
- .
- [link](foo(and(bar)))
- .
- <p>[link](foo(and(bar)))</p>
- .
- .
- [link](foo(and\(bar\)))
- .
- <p><a href="foo(and(bar))">link</a></p>
- .
- .
- [link](<foo(and(bar))>)
- .
- <p><a href="foo(and(bar))">link</a></p>
- .
- Parentheses and other symbols can also be escaped, as usual
- in Markdown:
- .
- [link](foo\)\:)
- .
- <p><a href="foo):">link</a></p>
- .
- URL-escaping should be left alone inside the destination, as all
- URL-escaped characters are also valid URL characters. HTML entities in
- the destination will be parsed into the corresponding unicode
- codepoints, as usual, and optionally URL-escaped when written as HTML.
- .
- [link](foo%20bä)
- .
- <p><a href="foo%20b%C3%A4">link</a></p>
- .
- Note that, because titles can often be parsed as destinations,
- if you try to omit the destination and keep the title, you'll
- get unexpected results:
- .
- [link]("title")
- .
- <p><a href="%22title%22">link</a></p>
- .
- Titles may be in single quotes, double quotes, or parentheses:
- .
- [link](/url "title")
- [link](/url 'title')
- [link](/url (title))
- .
- <p><a href="/url" title="title">link</a>
- <a href="/url" title="title">link</a>
- <a href="/url" title="title">link</a></p>
- .
- Backslash escapes and entities may be used in titles:
- .
- [link](/url "title \""")
- .
- <p><a href="/url" title="title """>link</a></p>
- .
- Nested balanced quotes are not allowed without escaping:
- .
- [link](/url "title "and" title")
- .
- <p>[link](/url "title "and" title")</p>
- .
- But it is easy to work around this by using a different quote type:
- .
- [link](/url 'title "and" title')
- .
- <p><a href="/url" title="title "and" title">link</a></p>
- .
- (Note: `Markdown.pl` did allow double quotes inside a double-quoted
- title, and its test suite included a test demonstrating this.
- But it is hard to see a good rationale for the extra complexity this
- brings, since there are already many ways---backslash escaping,
- entities, or using a different quote type for the enclosing title---to
- write titles containing double quotes. `Markdown.pl`'s handling of
- titles has a number of other strange features. For example, it allows
- single-quoted titles in inline links, but not reference links. And, in
- reference links but not inline links, it allows a title to begin with
- `"` and end with `)`. `Markdown.pl` 1.0.1 even allows titles with no closing
- quotation mark, though 1.0.2b8 does not. It seems preferable to adopt
- a simple, rational rule that works the same way in inline links and
- link reference definitions.)
- [Whitespace] is allowed around the destination and title:
- .
- [link]( /uri
- "title" )
- .
- <p><a href="/uri" title="title">link</a></p>
- .
- But it is not allowed between the link text and the
- following parenthesis:
- .
- [link] (/uri)
- .
- <p>[link] (/uri)</p>
- .
- The link text may contain balanced brackets, but not unbalanced ones,
- unless they are escaped:
- .
- [link [foo [bar]]](/uri)
- .
- <p><a href="/uri">link [foo [bar]]</a></p>
- .
- .
- [link] bar](/uri)
- .
- <p>[link] bar](/uri)</p>
- .
- .
- [link [bar](/uri)
- .
- <p>[link <a href="/uri">bar</a></p>
- .
- .
- [link \[bar](/uri)
- .
- <p><a href="/uri">link [bar</a></p>
- .
- The link text may contain inline content:
- .
- [link *foo **bar** `#`*](/uri)
- .
- <p><a href="/uri">link <em>foo <strong>bar</strong> <code>#</code></em></a></p>
- .
- .
- [](/uri)
- .
- <p><a href="/uri"><img src="moon.jpg" alt="moon" /></a></p>
- .
- However, links may not contain other links, at any level of nesting.
- .
- [foo [bar](/uri)](/uri)
- .
- <p>[foo <a href="/uri">bar</a>](/uri)</p>
- .
- .
- [foo *[bar [baz](/uri)](/uri)*](/uri)
- .
- <p>[foo <em>[bar <a href="/uri">baz</a>](/uri)</em>](/uri)</p>
- .
- .
- ](uri2)](uri3)
- .
- <p><img src="uri3" alt="[foo](uri2)" /></p>
- .
- These cases illustrate the precedence of link text grouping over
- emphasis grouping:
- .
- *[foo*](/uri)
- .
- <p>*<a href="/uri">foo*</a></p>
- .
- .
- [foo *bar](baz*)
- .
- <p><a href="baz*">foo *bar</a></p>
- .
- Note that brackets that *aren't* part of links do not take
- precedence:
- .
- *foo [bar* baz]
- .
- <p><em>foo [bar</em> baz]</p>
- .
- These cases illustrate the precedence of HTML tags, code spans,
- and autolinks over link grouping:
- .
- [foo <bar attr="](baz)">
- .
- <p>[foo <bar attr="](baz)"></p>
- .
- .
- [foo`](/uri)`
- .
- <p>[foo<code>](/uri)</code></p>
- .
- .
- [foo<http://example.com/?search=](uri)>
- .
- <p>[foo<a href="http://example.com/?search=%5D(uri)">http://example.com/?search=](uri)</a></p>
- .
- There are three kinds of [reference link](@reference-link)s:
- [full](#full-reference-link), [collapsed](#collapsed-reference-link),
- and [shortcut](#shortcut-reference-link).
- A [full reference link](@full-reference-link)
- consists of a [link text], optional [whitespace], and a [link label]
- that [matches] a [link reference definition] elsewhere in the document.
- A [link label](@link-label) begins with a left bracket (`[`) and ends
- with the first right bracket (`]`) that is not backslash-escaped.
- Between these brackets there must be at least one non-[whitespace character].
- Unescaped square bracket characters are not allowed in
- [link label]s. A link label can have at most 999
- characters inside the square brackets.
- One label [matches](@matches)
- another just in case their normalized forms are equal. To normalize a
- label, perform the *unicode case fold* and collapse consecutive internal
- [whitespace] to a single space. If there are multiple
- matching reference link definitions, the one that comes first in the
- document is used. (It is desirable in such cases to emit a warning.)
- The contents of the first link label are parsed as inlines, which are
- used as the link's text. The link's URI and title are provided by the
- matching [link reference definition].
- Here is a simple example:
- .
- [foo][bar]
- [bar]: /url "title"
- .
- <p><a href="/url" title="title">foo</a></p>
- .
- The rules for the [link text] are the same as with
- [inline link]s. Thus:
- The link text may contain balanced brackets, but not unbalanced ones,
- unless they are escaped:
- .
- [link [foo [bar]]][ref]
- [ref]: /uri
- .
- <p><a href="/uri">link [foo [bar]]</a></p>
- .
- .
- [link \[bar][ref]
- [ref]: /uri
- .
- <p><a href="/uri">link [bar</a></p>
- .
- The link text may contain inline content:
- .
- [link *foo **bar** `#`*][ref]
- [ref]: /uri
- .
- <p><a href="/uri">link <em>foo <strong>bar</strong> <code>#</code></em></a></p>
- .
- .
- [][ref]
- [ref]: /uri
- .
- <p><a href="/uri"><img src="moon.jpg" alt="moon" /></a></p>
- .
- However, links may not contain other links, at any level of nesting.
- .
- [foo [bar](/uri)][ref]
- [ref]: /uri
- .
- <p>[foo <a href="/uri">bar</a>]<a href="/uri">ref</a></p>
- .
- .
- [foo *bar [baz][ref]*][ref]
- [ref]: /uri
- .
- <p>[foo <em>bar <a href="/uri">baz</a></em>]<a href="/uri">ref</a></p>
- .
- (In the examples above, we have two [shortcut reference link]s
- instead of one [full reference link].)
- The following cases illustrate the precedence of link text grouping over
- emphasis grouping:
- .
- *[foo*][ref]
- [ref]: /uri
- .
- <p>*<a href="/uri">foo*</a></p>
- .
- .
- [foo *bar][ref]
- [ref]: /uri
- .
- <p><a href="/uri">foo *bar</a></p>
- .
- These cases illustrate the precedence of HTML tags, code spans,
- and autolinks over link grouping:
- .
- [foo <bar attr="][ref]">
- [ref]: /uri
- .
- <p>[foo <bar attr="][ref]"></p>
- .
- .
- [foo`][ref]`
- [ref]: /uri
- .
- <p>[foo<code>][ref]</code></p>
- .
- .
- [foo<http://example.com/?search=][ref]>
- [ref]: /uri
- .
- <p>[foo<a href="http://example.com/?search=%5D%5Bref%5D">http://example.com/?search=][ref]</a></p>
- .
- Matching is case-insensitive:
- .
- [foo][BaR]
- [bar]: /url "title"
- .
- <p><a href="/url" title="title">foo</a></p>
- .
- Unicode case fold is used:
- .
- [Толпой][Толпой] is a Russian word.
- [ТОЛПОЙ]: /url
- .
- <p><a href="/url">Толпой</a> is a Russian word.</p>
- .
- Consecutive internal [whitespace] is treated as one space for
- purposes of determining matching:
- .
- [Foo
- bar]: /url
- [Baz][Foo bar]
- .
- <p><a href="/url">Baz</a></p>
- .
- There can be [whitespace] between the [link text] and the [link label]:
- .
- [foo] [bar]
- [bar]: /url "title"
- .
- <p><a href="/url" title="title">foo</a></p>
- .
- .
- [foo]
- [bar]
- [bar]: /url "title"
- .
- <p><a href="/url" title="title">foo</a></p>
- .
- When there are multiple matching [link reference definition]s,
- the first is used:
- .
- [foo]: /url1
- [foo]: /url2
- [bar][foo]
- .
- <p><a href="/url1">bar</a></p>
- .
- Note that matching is performed on normalized strings, not parsed
- inline content. So the following does not match, even though the
- labels define equivalent inline content:
- .
- [bar][foo\!]
- [foo!]: /url
- .
- <p>[bar][foo!]</p>
- .
- [Link label]s cannot contain brackets, unless they are
- backslash-escaped:
- .
- [foo][ref[]
- [ref[]: /uri
- .
- <p>[foo][ref[]</p>
- <p>[ref[]: /uri</p>
- .
- .
- [foo][ref[bar]]
- [ref[bar]]: /uri
- .
- <p>[foo][ref[bar]]</p>
- <p>[ref[bar]]: /uri</p>
- .
- .
- [[[foo]]]
- [[[foo]]]: /url
- .
- <p>[[[foo]]]</p>
- <p>[[[foo]]]: /url</p>
- .
- .
- [foo][ref\[]
- [ref\[]: /uri
- .
- <p><a href="/uri">foo</a></p>
- .
- A [link label] must contain at least one non-[whitespace character]:
- .
- []
- []: /uri
- .
- <p>[]</p>
- <p>[]: /uri</p>
- .
- .
- [
- ]
- [
- ]: /uri
- .
- <p>[
- ]</p>
- <p>[
- ]: /uri</p>
- .
- A [collapsed reference link](@collapsed-reference-link)
- consists of a [link label] that [matches] a
- [link reference definition] elsewhere in the
- document, optional [whitespace], and the string `[]`.
- The contents of the first link label are parsed as inlines,
- which are used as the link's text. The link's URI and title are
- provided by the matching reference link definition. Thus,
- `[foo][]` is equivalent to `[foo][foo]`.
- .
- [foo][]
- [foo]: /url "title"
- .
- <p><a href="/url" title="title">foo</a></p>
- .
- .
- [*foo* bar][]
- [*foo* bar]: /url "title"
- .
- <p><a href="/url" title="title"><em>foo</em> bar</a></p>
- .
- The link labels are case-insensitive:
- .
- [Foo][]
- [foo]: /url "title"
- .
- <p><a href="/url" title="title">Foo</a></p>
- .
- As with full reference links, [whitespace] is allowed
- between the two sets of brackets:
- .
- [foo]
- []
- [foo]: /url "title"
- .
- <p><a href="/url" title="title">foo</a></p>
- .
- A [shortcut reference link](@shortcut-reference-link)
- consists of a [link label] that [matches] a
- [link reference definition] elsewhere in the
- document and is not followed by `[]` or a link label.
- The contents of the first link label are parsed as inlines,
- which are used as the link's text. the link's URI and title
- are provided by the matching link reference definition.
- Thus, `[foo]` is equivalent to `[foo][]`.
- .
- [foo]
- [foo]: /url "title"
- .
- <p><a href="/url" title="title">foo</a></p>
- .
- .
- [*foo* bar]
- [*foo* bar]: /url "title"
- .
- <p><a href="/url" title="title"><em>foo</em> bar</a></p>
- .
- .
- [[*foo* bar]]
- [*foo* bar]: /url "title"
- .
- <p>[<a href="/url" title="title"><em>foo</em> bar</a>]</p>
- .
- .
- [[bar [foo]
- [foo]: /url
- .
- <p>[[bar <a href="/url">foo</a></p>
- .
- The link labels are case-insensitive:
- .
- [Foo]
- [foo]: /url "title"
- .
- <p><a href="/url" title="title">Foo</a></p>
- .
- A space after the link text should be preserved:
- .
- [foo] bar
- [foo]: /url
- .
- <p><a href="/url">foo</a> bar</p>
- .
- If you just want bracketed text, you can backslash-escape the
- opening bracket to avoid links:
- .
- \[foo]
- [foo]: /url "title"
- .
- <p>[foo]</p>
- .
- Note that this is a link, because a link label ends with the first
- following closing bracket:
- .
- [foo*]: /url
- *[foo*]
- .
- <p>*<a href="/url">foo*</a></p>
- .
- Full references take precedence over shortcut references:
- .
- [foo][bar]
- [foo]: /url1
- [bar]: /url2
- .
- <p><a href="/url2">foo</a></p>
- .
- In the following case `[bar][baz]` is parsed as a reference,
- `[foo]` as normal text:
- .
- [foo][bar][baz]
- [baz]: /url
- .
- <p>[foo]<a href="/url">bar</a></p>
- .
- Here, though, `[foo][bar]` is parsed as a reference, since
- `[bar]` is defined:
- .
- [foo][bar][baz]
- [baz]: /url1
- [bar]: /url2
- .
- <p><a href="/url2">foo</a><a href="/url1">baz</a></p>
- .
- Here `[foo]` is not parsed as a shortcut reference, because it
- is followed by a link label (even though `[bar]` is not defined):
- .
- [foo][bar][baz]
- [baz]: /url1
- [foo]: /url2
- .
- <p>[foo]<a href="/url1">bar</a></p>
- .
- ## Images
- Syntax for images is like the syntax for links, with one
- difference. Instead of [link text], we have an
- [image description](@image-description). The rules for this are the
- same as for [link text], except that (a) an
- image description starts with `
- .
- <p><img src="/url" alt="foo" title="title" /></p>
- .
- .
- ![foo *bar*]
- [foo *bar*]: train.jpg "train & tracks"
- .
- <p><img src="train.jpg" alt="foo bar" title="train & tracks" /></p>
- .
- .
- ](/url2)
- .
- <p><img src="/url2" alt="foo bar" /></p>
- .
- .
- ](/url2)
- .
- <p><img src="/url2" alt="foo bar" /></p>
- .
- Though this spec is concerned with parsing, not rendering, it is
- recommended that in rendering to HTML, only the plain string content
- of the [image description] be used. Note that in
- the above example, the alt attribute's value is `foo bar`, not `foo
- [bar](/url)` or `foo <a href="/url">bar</a>`. Only the plain string
- content is rendered, without formatting.
- .
- ![foo *bar*][]
- [foo *bar*]: train.jpg "train & tracks"
- .
- <p><img src="train.jpg" alt="foo bar" title="train & tracks" /></p>
- .
- .
- ![foo *bar*][foobar]
- [FOOBAR]: train.jpg "train & tracks"
- .
- <p><img src="train.jpg" alt="foo bar" title="train & tracks" /></p>
- .
- .
- 
- .
- <p><img src="train.jpg" alt="foo" /></p>
- .
- .
- My 
- .
- <p>My <img src="/path/to/train.jpg" alt="foo bar" title="title" /></p>
- .
- .
- 
- .
- <p><img src="url" alt="foo" /></p>
- .
- .
- 
- .
- <p><img src="/url" alt="" /></p>
- .
- Reference-style:
- .
- ![foo] [bar]
- [bar]: /url
- .
- <p><img src="/url" alt="foo" /></p>
- .
- .
- ![foo] [bar]
- [BAR]: /url
- .
- <p><img src="/url" alt="foo" /></p>
- .
- Collapsed:
- .
- ![foo][]
- [foo]: /url "title"
- .
- <p><img src="/url" alt="foo" title="title" /></p>
- .
- .
- ![*foo* bar][]
- [*foo* bar]: /url "title"
- .
- <p><img src="/url" alt="foo bar" title="title" /></p>
- .
- The labels are case-insensitive:
- .
- ![Foo][]
- [foo]: /url "title"
- .
- <p><img src="/url" alt="Foo" title="title" /></p>
- .
- As with full reference links, [whitespace] is allowed
- between the two sets of brackets:
- .
- ![foo]
- []
- [foo]: /url "title"
- .
- <p><img src="/url" alt="foo" title="title" /></p>
- .
- Shortcut:
- .
- ![foo]
- [foo]: /url "title"
- .
- <p><img src="/url" alt="foo" title="title" /></p>
- .
- .
- ![*foo* bar]
- [*foo* bar]: /url "title"
- .
- <p><img src="/url" alt="foo bar" title="title" /></p>
- .
- Note that link labels cannot contain unescaped brackets:
- .
- ![[foo]]
- [[foo]]: /url "title"
- .
- <p>![[foo]]</p>
- <p>[[foo]]: /url "title"</p>
- .
- The link labels are case-insensitive:
- .
- ![Foo]
- [foo]: /url "title"
- .
- <p><img src="/url" alt="Foo" title="title" /></p>
- .
- If you just want bracketed text, you can backslash-escape the
- opening `!` and `[`:
- .
- \!\[foo]
- [foo]: /url "title"
- .
- <p>![foo]</p>
- .
- If you want a link after a literal `!`, backslash-escape the
- `!`:
- .
- \![foo]
- [foo]: /url "title"
- .
- <p>!<a href="/url" title="title">foo</a></p>
- .
- ## Autolinks
- [Autolink](@autolink)s are absolute URIs and email addresses inside
- `<` and `>`. They are parsed as links, with the URL or email address
- as the link label.
- A [URI autolink](@uri-autolink) consists of `<`, followed by an
- [absolute URI] not containing `<`, followed by `>`. It is parsed as
- a link to the URI, with the URI as the link's label.
- An [absolute URI](@absolute-uri),
- for these purposes, consists of a [scheme] followed by a colon (`:`)
- followed by zero or more characters other than ASCII
- [whitespace] and control characters, `<`, and `>`. If
- the URI includes these characters, you must use percent-encoding
- (e.g. `%20` for a space).
- The following [schemes](@scheme)
- are recognized (case-insensitive):
- `coap`, `doi`, `javascript`, `aaa`, `aaas`, `about`, `acap`, `cap`,
- `cid`, `crid`, `data`, `dav`, `dict`, `dns`, `file`, `ftp`, `geo`, `go`,
- `gopher`, `h323`, `http`, `https`, `iax`, `icap`, `im`, `imap`, `info`,
- `ipp`, `iris`, `iris.beep`, `iris.xpc`, `iris.xpcs`, `iris.lwz`, `ldap`,
- `mailto`, `mid`, `msrp`, `msrps`, `mtqp`, `mupdate`, `news`, `nfs`,
- `ni`, `nih`, `nntp`, `opaquelocktoken`, `pop`, `pres`, `rtsp`,
- `service`, `session`, `shttp`, `sieve`, `sip`, `sips`, `sms`, `snmp`,`
- soap.beep`, `soap.beeps`, `tag`, `tel`, `telnet`, `tftp`, `thismessage`,
- `tn3270`, `tip`, `tv`, `urn`, `vemmi`, `ws`, `wss`, `xcon`,
- `xcon-userid`, `xmlrpc.beep`, `xmlrpc.beeps`, `xmpp`, `z39.50r`,
- `z39.50s`, `adiumxtra`, `afp`, `afs`, `aim`, `apt`,` attachment`, `aw`,
- `beshare`, `bitcoin`, `bolo`, `callto`, `chrome`,` chrome-extension`,
- `com-eventbrite-attendee`, `content`, `cvs`,` dlna-playsingle`,
- `dlna-playcontainer`, `dtn`, `dvb`, `ed2k`, `facetime`, `feed`,
- `finger`, `fish`, `gg`, `git`, `gizmoproject`, `gtalk`, `hcp`, `icon`,
- `ipn`, `irc`, `irc6`, `ircs`, `itms`, `jar`, `jms`, `keyparc`, `lastfm`,
- `ldaps`, `magnet`, `maps`, `market`,` message`, `mms`, `ms-help`,
- `msnim`, `mumble`, `mvn`, `notes`, `oid`, `palm`, `paparazzi`,
- `platform`, `proxy`, `psyc`, `query`, `res`, `resource`, `rmi`, `rsync`,
- `rtmp`, `secondlife`, `sftp`, `sgn`, `skype`, `smb`, `soldat`,
- `spotify`, `ssh`, `steam`, `svn`, `teamspeak`, `things`, `udp`,
- `unreal`, `ut2004`, `ventrilo`, `view-source`, `webcal`, `wtai`,
- `wyciwyg`, `xfire`, `xri`, `ymsgr`.
- Here are some valid autolinks:
- .
- <http://foo.bar.baz>
- .
- <p><a href="http://foo.bar.baz">http://foo.bar.baz</a></p>
- .
- .
- <http://foo.bar.baz/test?q=hello&id=22&boolean>
- .
- <p><a href="http://foo.bar.baz/test?q=hello&id=22&boolean">http://foo.bar.baz/test?q=hello&id=22&boolean</a></p>
- .
- .
- <irc://foo.bar:2233/baz>
- .
- <p><a href="irc://foo.bar:2233/baz">irc://foo.bar:2233/baz</a></p>
- .
- Uppercase is also fine:
- .
- <MAILTO:FOO@BAR.BAZ>
- .
- <p><a href="MAILTO:FOO@BAR.BAZ">MAILTO:FOO@BAR.BAZ</a></p>
- .
- Spaces are not allowed in autolinks:
- .
- <http://foo.bar/baz bim>
- .
- <p><http://foo.bar/baz bim></p>
- .
- Backslash-escapes do not work inside autolinks:
- .
- <http://example.com/\[\>
- .
- <p><a href="http://example.com/%5C%5B%5C">http://example.com/\[\</a></p>
- .
- An [email autolink](@email-autolink)
- consists of `<`, followed by an [email address],
- followed by `>`. The link's label is the email address,
- and the URL is `mailto:` followed by the email address.
- An [email address](@email-address),
- for these purposes, is anything that matches
- the [non-normative regex from the HTML5
- spec](https://html.spec.whatwg.org/multipage/forms.html#e-mail-state-(type=email)):
- /^[a-zA-Z0-9.!#$%&'*+/=?^_`{|}~-]+@[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?
- (?:\.[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?)*$/
- Examples of email autolinks:
- .
- <foo@bar.example.com>
- .
- <p><a href="mailto:foo@bar.example.com">foo@bar.example.com</a></p>
- .
- .
- <foo+special@Bar.baz-bar0.com>
- .
- <p><a href="mailto:foo+special@Bar.baz-bar0.com">foo+special@Bar.baz-bar0.com</a></p>
- .
- Backslash-escapes do not work inside email autolinks:
- .
- <foo\+@bar.example.com>
- .
- <p><foo+@bar.example.com></p>
- .
- These are not autolinks:
- .
- <>
- .
- <p><></p>
- .
- .
- <heck://bing.bong>
- .
- <p><heck://bing.bong></p>
- .
- .
- < http://foo.bar >
- .
- <p>< http://foo.bar ></p>
- .
- .
- <foo.bar.baz>
- .
- <p><foo.bar.baz></p>
- .
- .
- <localhost:5001/foo>
- .
- <p><localhost:5001/foo></p>
- .
- .
- http://example.com
- .
- <p>http://example.com</p>
- .
- .
- foo@bar.example.com
- .
- <p>foo@bar.example.com</p>
- .
- ## Raw HTML
- Text between `<` and `>` that looks like an HTML tag is parsed as a
- raw HTML tag and will be rendered in HTML without escaping.
- Tag and attribute names are not limited to current HTML tags,
- so custom tags (and even, say, DocBook tags) may be used.
- Here is the grammar for tags:
- A [tag name](@tag-name) consists of an ASCII letter
- followed by zero or more ASCII letters or digits.
- An [attribute](@attribute) consists of [whitespace],
- an [attribute name], and an optional
- [attribute value specification].
- An [attribute name](@attribute-name)
- consists of an ASCII letter, `_`, or `:`, followed by zero or more ASCII
- letters, digits, `_`, `.`, `:`, or `-`. (Note: This is the XML
- specification restricted to ASCII. HTML5 is laxer.)
- An [attribute value specification](@attribute-value-specification)
- consists of optional [whitespace],
- a `=` character, optional [whitespace], and an [attribute
- value].
- An [attribute value](@attribute-value)
- consists of an [unquoted attribute value],
- a [single-quoted attribute value], or a [double-quoted attribute value].
- An [unquoted attribute value](@unquoted-attribute-value)
- is a nonempty string of characters not
- including spaces, `"`, `'`, `=`, `<`, `>`, or `` ` ``.
- A [single-quoted attribute value](@single-quoted-attribute-value)
- consists of `'`, zero or more
- characters not including `'`, and a final `'`.
- A [double-quoted attribute value](@double-quoted-attribute-value)
- consists of `"`, zero or more
- characters not including `"`, and a final `"`.
- An [open tag](@open-tag) consists of a `<` character, a [tag name],
- zero or more [attributes], optional [whitespace], an optional `/`
- character, and a `>` character.
- A [closing tag](@closing-tag) consists of the string `</`, a
- [tag name], optional [whitespace], and the character `>`.
- An [HTML comment](@html-comment) consists of `<!--` + *text* + `-->`,
- where *text* does not start with `>` or `->`, does not end with `-`,
- and does not contain `--`. (See the
- [HTML5 spec](http://www.w3.org/TR/html5/syntax.html#comments).)
- A [processing instruction](@processing-instruction)
- consists of the string `<?`, a string
- of characters not including the string `?>`, and the string
- `?>`.
- A [declaration](@declaration) consists of the
- string `<!`, a name consisting of one or more uppercase ASCII letters,
- [whitespace], a string of characters not including the
- character `>`, and the character `>`.
- A [CDATA section](@cdata-section) consists of
- the string `<![CDATA[`, a string of characters not including the string
- `]]>`, and the string `]]>`.
- An [HTML tag](@html-tag) consists of an [open tag], a [closing tag],
- an [HTML comment], a [processing instruction], a [declaration],
- or a [CDATA section].
- Here are some simple open tags:
- .
- <a><bab><c2c>
- .
- <p><a><bab><c2c></p>
- .
- Empty elements:
- .
- <a/><b2/>
- .
- <p><a/><b2/></p>
- .
- [Whitespace] is allowed:
- .
- <a /><b2
- data="foo" >
- .
- <p><a /><b2
- data="foo" ></p>
- .
- With attributes:
- .
- <a foo="bar" bam = 'baz <em>"</em>'
- _boolean zoop:33=zoop:33 />
- .
- <p><a foo="bar" bam = 'baz <em>"</em>'
- _boolean zoop:33=zoop:33 /></p>
- .
- Illegal tag names, not parsed as HTML:
- .
- <33> <__>
- .
- <p><33> <__></p>
- .
- Illegal attribute names:
- .
- <a h*#ref="hi">
- .
- <p><a h*#ref="hi"></p>
- .
- Illegal attribute values:
- .
- <a href="hi'> <a href=hi'>
- .
- <p><a href="hi'> <a href=hi'></p>
- .
- Illegal [whitespace]:
- .
- < a><
- foo><bar/ >
- .
- <p>< a><
- foo><bar/ ></p>
- .
- Missing [whitespace]:
- .
- <a href='bar'title=title>
- .
- <p><a href='bar'title=title></p>
- .
- Closing tags:
- .
- </a>
- </foo >
- .
- <p></a>
- </foo ></p>
- .
- Illegal attributes in closing tag:
- .
- </a href="foo">
- .
- <p></a href="foo"></p>
- .
- Comments:
- .
- foo <!-- this is a
- comment - with hyphen -->
- .
- <p>foo <!-- this is a
- comment - with hyphen --></p>
- .
- .
- foo <!-- not a comment -- two hyphens -->
- .
- <p>foo <!-- not a comment -- two hyphens --></p>
- .
- Not comments:
- .
- foo <!--> foo -->
- foo <!-- foo--->
- .
- <p>foo <!--> foo --></p>
- <p>foo <!-- foo---></p>
- .
- Processing instructions:
- .
- foo <?php echo $a; ?>
- .
- <p>foo <?php echo $a; ?></p>
- .
- Declarations:
- .
- foo <!ELEMENT br EMPTY>
- .
- <p>foo <!ELEMENT br EMPTY></p>
- .
- CDATA sections:
- .
- foo <![CDATA[>&<]]>
- .
- <p>foo <![CDATA[>&<]]></p>
- .
- Entities are preserved in HTML attributes:
- .
- <a href="ö">
- .
- <p><a href="ö"></p>
- .
- Backslash escapes do not work in HTML attributes:
- .
- <a href="\*">
- .
- <p><a href="\*"></p>
- .
- .
- <a href="\"">
- .
- <p><a href="""></p>
- .
- ## Hard line breaks
- A line break (not in a code span or HTML tag) that is preceded
- by two or more spaces and does not occur at the end of a block
- is parsed as a [hard line break](@hard-line-break) (rendered
- in HTML as a `<br />` tag):
- .
- foo
- baz
- .
- <p>foo<br />
- baz</p>
- .
- For a more visible alternative, a backslash before the
- [line ending] may be used instead of two spaces:
- .
- foo\
- baz
- .
- <p>foo<br />
- baz</p>
- .
- More than two spaces can be used:
- .
- foo
- baz
- .
- <p>foo<br />
- baz</p>
- .
- Leading spaces at the beginning of the next line are ignored:
- .
- foo
- bar
- .
- <p>foo<br />
- bar</p>
- .
- .
- foo\
- bar
- .
- <p>foo<br />
- bar</p>
- .
- Line breaks can occur inside emphasis, links, and other constructs
- that allow inline content:
- .
- *foo
- bar*
- .
- <p><em>foo<br />
- bar</em></p>
- .
- .
- *foo\
- bar*
- .
- <p><em>foo<br />
- bar</em></p>
- .
- Line breaks do not occur inside code spans
- .
- `code
- span`
- .
- <p><code>code span</code></p>
- .
- .
- `code\
- span`
- .
- <p><code>code\ span</code></p>
- .
- or HTML tags:
- .
- <a href="foo
- bar">
- .
- <p><a href="foo
- bar"></p>
- .
- .
- <a href="foo\
- bar">
- .
- <p><a href="foo\
- bar"></p>
- .
- Hard line breaks are for separating inline content within a block.
- Neither syntax for hard line breaks works at the end of a paragraph or
- other block element:
- .
- foo\
- .
- <p>foo\</p>
- .
- .
- foo
- .
- <p>foo</p>
- .
- .
- ### foo\
- .
- <h3>foo\</h3>
- .
- .
- ### foo
- .
- <h3>foo</h3>
- .
- ## Soft line breaks
- A regular line break (not in a code span or HTML tag) that is not
- preceded by two or more spaces or a backslash is parsed as a
- softbreak. (A softbreak may be rendered in HTML either as a
- [line ending] or as a space. The result will be the same in
- browsers. In the examples here, a [line ending] will be used.)
- .
- foo
- baz
- .
- <p>foo
- baz</p>
- .
- Spaces at the end of the line and beginning of the next line are
- removed:
- .
- foo
- baz
- .
- <p>foo
- baz</p>
- .
- A conforming parser may render a soft line break in HTML either as a
- line break or as a space.
- A renderer may also provide an option to render soft line breaks
- as hard line breaks.
- ## Textual content
- Any characters not given an interpretation by the above rules will
- be parsed as plain textual content.
- .
- hello $.;'there
- .
- <p>hello $.;'there</p>
- .
- .
- Foo χρῆν
- .
- <p>Foo χρῆν</p>
- .
- Internal spaces are preserved verbatim:
- .
- Multiple spaces
- .
- <p>Multiple spaces</p>
- .
- <!-- END TESTS -->
- # Appendix A: A parsing strategy {-}
- ## Overview {-}
- Parsing has two phases:
- 1. In the first phase, lines of input are consumed and the block
- structure of the document---its division into paragraphs, block quotes,
- list items, and so on---is constructed. Text is assigned to these
- blocks but not parsed. Link reference definitions are parsed and a
- map of links is constructed.
- 2. In the second phase, the raw text contents of paragraphs and headers
- are parsed into sequences of Markdown inline elements (strings,
- code spans, links, emphasis, and so on), using the map of link
- references constructed in phase 1.
- ## The document tree {-}
- At each point in processing, the document is represented as a tree of
- **blocks**. The root of the tree is a `document` block. The `document`
- may have any number of other blocks as **children**. These children
- may, in turn, have other blocks as children. The last child of a block
- is normally considered **open**, meaning that subsequent lines of input
- can alter its contents. (Blocks that are not open are **closed**.)
- Here, for example, is a possible document tree, with the open blocks
- marked by arrows:
- ``` tree
- -> document
- -> block_quote
- paragraph
- "Lorem ipsum dolor\nsit amet."
- -> list (type=bullet tight=true bullet_char=-)
- list_item
- paragraph
- "Qui *quodsi iracundia*"
- -> list_item
- -> paragraph
- "aliquando id"
- ```
- ## How source lines alter the document tree {-}
- Each line that is processed has an effect on this tree. The line is
- analyzed and, depending on its contents, the document may be altered
- in one or more of the following ways:
- 1. One or more open blocks may be closed.
- 2. One or more new blocks may be created as children of the
- last open block.
- 3. Text may be added to the last (deepest) open block remaining
- on the tree.
- Once a line has been incorporated into the tree in this way,
- it can be discarded, so input can be read in a stream.
- We can see how this works by considering how the tree above is
- generated by four lines of Markdown:
- ``` markdown
- > Lorem ipsum dolor
- sit amet.
- > - Qui *quodsi iracundia*
- > - aliquando id
- ```
- At the outset, our document model is just
- ``` tree
- -> document
- ```
- The first line of our text,
- ``` markdown
- > Lorem ipsum dolor
- ```
- causes a `block_quote` block to be created as a child of our
- open `document` block, and a `paragraph` block as a child of
- the `block_quote`. Then the text is added to the last open
- block, the `paragraph`:
- ``` tree
- -> document
- -> block_quote
- -> paragraph
- "Lorem ipsum dolor"
- ```
- The next line,
- ``` markdown
- sit amet.
- ```
- is a "lazy continuation" of the open `paragraph`, so it gets added
- to the paragraph's text:
- ``` tree
- -> document
- -> block_quote
- -> paragraph
- "Lorem ipsum dolor\nsit amet."
- ```
- The third line,
- ``` markdown
- > - Qui *quodsi iracundia*
- ```
- causes the `paragraph` block to be closed, and a new `list` block
- opened as a child of the `block_quote`. A `list_item` is also
- added as a child of the `list`, and a `paragraph` as a child of
- the `list_item`. The text is then added to the new `paragraph`:
- ``` tree
- -> document
- -> block_quote
- paragraph
- "Lorem ipsum dolor\nsit amet."
- -> list (type=bullet tight=true bullet_char=-)
- -> list_item
- -> paragraph
- "Qui *quodsi iracundia*"
- ```
- The fourth line,
- ``` markdown
- > - aliquando id
- ```
- causes the `list_item` (and its child the `paragraph`) to be closed,
- and a new `list_item` opened up as child of the `list`. A `paragraph`
- is added as a child of the new `list_item`, to contain the text.
- We thus obtain the final tree:
- ``` tree
- -> document
- -> block_quote
- paragraph
- "Lorem ipsum dolor\nsit amet."
- -> list (type=bullet tight=true bullet_char=-)
- list_item
- paragraph
- "Qui *quodsi iracundia*"
- -> list_item
- -> paragraph
- "aliquando id"
- ```
- ## From block structure to the final document {-}
- Once all of the input has been parsed, all open blocks are closed.
- We then "walk the tree," visiting every node, and parse raw
- string contents of paragraphs and headers as inlines. At this
- point we have seen all the link reference definitions, so we can
- resolve reference links as we go.
- ``` tree
- document
- block_quote
- paragraph
- str "Lorem ipsum dolor"
- softbreak
- str "sit amet."
- list (type=bullet tight=true bullet_char=-)
- list_item
- paragraph
- str "Qui "
- emph
- str "quodsi iracundia"
- list_item
- paragraph
- str "aliquando id"
- ```
- Notice how the [line ending] in the first paragraph has
- been parsed as a `softbreak`, and the asterisks in the first list item
- have become an `emph`.
- The document can be rendered as HTML, or in any other format, given
- an appropriate renderer.
|