本说明受到 Anna Hester 的 HTMLToolkit 和 Roberto Ierusalimschy 的技术说明 9(关于连接字符串的低效率)的启发。语法和例如 HTML 文档结构的基础数学抽象是树。我们在此提供一个名为 weave 的简短 Lua 程序,用于根据描述构建树并将其写出。例如,此文档是通过适用于 HTML 的 weave 构建的。HTML 的一大缺点是它既没有变量也没有抽象功能。我现在用 weave 制作我的所有网页。此网页的 weave 源代码显示在下面。 所谓“树”,我们指的是一个表,其索引为整数,其值为字符串(树的叶)或树。以下是构造树的函数“node”。它有两种方法,“push”和“walk”。Push 插入一个新字符串或树。Walk 将函数递归应用于叶。 node = function (list) local a if list then a = list a.n = getn(list) else a = { n = 0 } end -- if if not a.push then a.push = function (self,x) tinsert(self,x) end end -- if if not a.walk then a.walk = function (self,f) local b for i = 1,self.n do b = self[i] if type(b) == "string" then f(b) else b:walk(f) end -- if end -- for end -- function end -- if return a end -- functionThis is the core of weave. Then some specific definitions are required to interpret the structure in question, in this case HTML. It makes sense to modularise weave by reading in these definitions using dofile. The final part of weave writes out the leaves to its output, 'out', which I am presuming is specified by the document being processed (which is given by 'arg[1]'). Here it is: page,out = PAGE(dofile(arg[1])) -- read the page description assert(page,"Cannot read "..arg[1]) writeto(out) page:walk(write) -- write the HTML file writeto()It remains to describe the HTML-specific part of weave, and, in particular, the function PAGE. This function takes three arguments, the title, the pathname of the file that the document is to reside in, and the tree describing the body of the document. It outputs the whole tree and the pathname. PAGE = function (title, saveas, body) local x = node { [[<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">]], HTML { HEAD { [[<meta name="MSSmartTagsPreventParsing" content="TRUE">]], TITLE { title } }, -- HEAD BODY(body) } -- HTML } -- node return x,saveas end -- functionWe have used the convention of using upper case versions of HTML tag names for constructors (i.e. nodes of the tree) to contrast with lower case variable names for the user to choose for their arguments, much in the style of Haskell. These are defined by: tag = function (name,attr) local f = function (obj) local n,att,x = %name,%attr,node() x:push "<" x:push(n) if att then x:push " " x:push(att) end -- if x:push ">" x:push(node(obj)) x:push "</" x:push(n) x:push ">\n" return x end -- function return f end -- function monotags = { "p","b","hr" } for _,s in monotags do setglobal(strupper(s),"\n<"..s..">\n") end -- for tagwords = { "html","head","body","title","center", "h1","h2","h3","h4","h5","h6","tt","b", "ul","ol","li","dl","dd","dt","tr","td", "th","pre","small"} for _,s in tagwords do setglobal(strupper(s),tag(s)) end -- forYou can probably now write your own definitions for anchors, images and other bits of HTML. A useful non-HTML addition is the facility to INCLUDE another file as a piece of text in the document. INCLUDE = function (fname) readfrom(fname) local text = read("*a") readfrom() return text end -- functionBy way of example, if you want to read this whole document all over again, but in the form of weave source code, here it is. -- Technote source for weave -- GCW August 2001 -- Please set value of filename for your own computer filename = "ltn010.html" -- Please adjust XXX yourself title = "Technical Note 10 - Weave your own description languages" this = "article" -- filename of this file. Please adjust if necessary. subtitle = "Web page construction as an example" author = "G.C.Wraith" mailto = "mailto:gavin@wraith.u-net.com" code = function (x) return B(TT(PRE {x})) end intro = [[This note was inspired by Anna Hester's HTMLToolkit and Roberto Ierusalimschy's Technical Note 9 on the inefficiencies of concatenating strings. The mathematical abstraction that underlies grammars and the structure of, for example HTML documents, is that of a tree. We present here a short Lua program, called weave, for building trees out of descriptions and writing them out. This document, for example, is constructed by weave adapted to HTML. The big drawback of HTML is that it has neither variables nor abstraction facilities. I make all my web pages with weave now.]] para1 = [[By a 'tree' we mean a table, whose indices are integers, and whose values are either strings (leaves of the tree) or trees. Here is the function 'node' which constructs trees. It has two methods, 'push' and 'walk'. Push inserts a new string or tree. Walk applies a function recursively to the leaves.]] node_def = [[ node = function (list) local a if list then a = list a.n = getn(list) else a = { n = 0 } end -- if if not a.push then a.push = function (self,x) tinsert(self,x) end end -- if if not a.walk then a.walk = function (self,f) local b for i = 1,self.n do b = self[i] if type(b) == "string" then f(b) else b:walk(f) end -- if end -- for end -- function end -- if return a end -- function ]] para2 = [[This is the core of weave. Then some specific definitions are required to interpret the structure in question, in this case HTML. It makes sense to modularise weave by reading in these definitions using dofile. The final part of weave writes out the leaves to its output, 'out', which I am presuming is specified by the document being processed (which is given by 'arg[1]'). Here it is:]] out_def = [[ page,out = PAGE(dofile(arg[1])) -- read the page description assert(page,"Cannot read "..arg[1]) writeto(out) page:walk(write) -- write the HTML file writeto() ]] para3 = [[It remains to describe the HTML-specific part of weave, and, in particular, the function PAGE. This function takes three arguments, the title, the pathname of the file that the document is to reside in, and the tree describing the body of the document. It outputs the whole tree and the pathname.]] PAGE_def = [[ PAGE = function (title, saveas, body) local x = node { [[<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">]], HTML { HEAD { [[<meta name="MSSmartTagsPreventParsing" content="TRUE">]], TITLE { title } }, -- HEAD BODY(body) } -- HTML } -- node return x,saveas end -- function ]] para4 = [[We have used the convention of using upper case versions of HTML tag names for constructors (i.e. nodes of the tree) to contrast with lower case variable names for the user to choose for their arguments, much in the style of Haskell. These are defined by:]] tag_def = [[ tag = function (name,attr) local f = function (obj) local n,att,x = %name,%attr,node() x:push "<" x:push(n) if att then x:push " " x:push(att) end -- if x:push ">" x:push(node(obj)) x:push "</" x:push(n) x:push ">\n" return x end -- function return f end -- function monotags = { "p","b","hr" } for _,s in monotags do setglobal(strupper(s),"\n<"..s..">\n") end -- for tagwords = { "html","head","body","title","center", "h1","h2","h3","h4","h5","h6","tt","b", "ul","ol","li","dl","dd","dt","tr","td", "th","pre","small"} for _,s in tagwords do setglobal(strupper(s),tag(s)) end -- for ]] para5 = [[You can probably now write your own definitions for anchors, images and other bits of HTML. A useful non-HTML addition is the facility to INCLUDE another file as a piece of text in the document.]] INCLUDE_def = [[ INCLUDE = function (fname) readfrom(fname) local text = read("*a") readfrom() return text end -- function ]] para6 = [[By way of example, if you want to read this whole document all over again, but in the form of weave source code, here it is.]] margin = function (x) return TABLE(TR(TD(x))) end get_source = " The weave source of this web page is shown below." body = { CENTER { H1 {title}, H3 {subtitle}, H6 {author} }, -- CENTER margin { P, intro, A("#source") {get_source}, P, para1, code(node_def), para2, code(out_def), para3, code(PAGE_def), para4, code(tag_def), para5, code(INCLUDE_def), para6, LABEL("source") {P}, code(INCLUDE(this)) -- self reference ! }, -- margin HR, SMALL { "by", A(mailto) {"gcw"} } } -- body return title, filename, body -- end weave source -------- |