XML
Extensible Markup Language is another language used to define the syntax of markup languages. XML is a subset of SGML, and is designed to represent arbitrary structured data in a text format. SGML is the Standard Generalised Markup Language, and is an enabling technology used in publishing applications such as HTML and XML - so a working knowledge of SGML
<http://www.w3.org/MarkUp/SGML/> may be handy. XSL is a style sheet language for styling XML documents.
Some sites to examine are:
What's my line?
When I put forward my opinion, I am reminded of the public speakers that get up on their soapbox each Sunday morning in Hyde Park, London to push forward their thoughts to the gathering crowd. You are my audience! Last year I started sending soapbox e-mail messages to my colleagues amusement, as well-formed XML, using my root tag element:
XML was designed to describe data with tags. It is not predefined like HTML, so you must define your own tags (as XML is self-describing). Soon I had a set of child tags for issues, facts, examples, questions and thoughts – like a developing schema or document type definition (DTD). I found that XML helped to 'see' the relationship of each element with indenting. Maybe those Hyde Park speakers practiced XML in their speeches, popular in the 19th century, without us really knowing! ☺
Instead of reading about my perspective on this subject in 12 point, Times Roman, formatted text, the notes are now presented as 'well formed' XML, using 10 point Courier New font. Note how hard it is for the eye and brain to 'parse' the raw XML content:
<?xml version="1.0" encoding='ISO-8859-1'?>
<soapbox>
<issue>
I hold the tenet that e-commerce and indeed many e-business applications, are not about commerce at all. I suggest that an e-commerce frameworkprovides a system for secure electronic transactions of a diverse nature.
</issue>
<issue>
A bank or credit card company desires a secure transaction for moving numbers and money around the Internet. Many Internet users need the same secure system for transactions without money involved at all.
<example>
a student and teacher need a secure system to exchange
assignments and marks;
</example>
<example>
a farmer and a government agency exchange water quota
figures for irrigation, also require a private, secure
transaction without money involved.
</example>
</issue>
<issue>
In this subject, you will use object modelling as a tool for designing a secure E-business application or Web object component, such as an E-catalogue, an interactive multimedia product for sale or a Shopping Cart.
</issue>
<question>
What is the significance of the thing you do?
</question>
<thought>
For each E-business component that you build and attach to Web site – the online store front, you should discuss how your component connects or relates to your UML-based model, the Online Store (Web site); its business plan for online transactions and the secure E-commerce model in the business plan.
</thought>
<issue>
Such thinking and decision-making by you, will assist the organization (and you) to see not only how all the bits fit together, but also how your work fits into a secure, portable, scalable, reusable and persistent (big picture) development plan for the E-systems infrastructure of the business entity.
</issue>
</soapbox>
XML is widely supported as an open technology for data exchange and is better designed to handle the task of managing information. XML was designed to describe data, where XML tags are not predefined in XML. You must define your own tags, hence XML is self-describing. XML uses a DTD (Document Type Definition) to formally describe the data.
XML is not a replacement for HTML. XML and HTML were designed with different goals: XML was designed to describe data and to focus on what data is; HTML was designed to display data and to focus on how data looks. HTML is about displaying information, XML is about describing information.
will be used to format and display the same data. It is strongly believed that XML will be as important to the future of the Web as HTML has been to the foundation of the Web. XML is the future for all data transmission and data manipulation over the Web, using new protocols like SOAP.
XML is extensible and portable
The tags used to markup HTML documents and the structure of HTML documents is predefined. The author of HTML documents can only use tags that are defined in the HTML standard. However, XML allows you to define your own tags and your own document structure. Since XML tags describe the data they contain, it is possible to search, sort, manipulate and render an XML document using related technologies such as XSL (Extensible Style sheet Language).
XML parser
XML docs are highly portable – any text editor can be used. XML is both human and machine-readable. When processing XML, and XML document end with a .xml extension. An XML parser is required to check doc syntax and can support the Document Object Model (DOM) and/or the Simple API for XML (SAX).
Several parsers are available free of charge for Java, Python and other languages. DOM-based parsers build a tree structure containing the XML doc's data in memory – it allows data to be programmatically manipulated. SAX-based parsers process the docs and generate events when tags, text, comments etc. are encountered – returning data from XML doc. XML parsers are kept at http://www.xml.com/.
XML DTD
An XML doc can reference an optional Document Type Definition (DTD) file – this defines how the XML doc is structuredXML can be used as 'well formed', using the rules of XML or 'validated' by use of a DTD.
How can XML be used?
XML can keep data separated from your HTML - HTML pages are used to display data. Data is often stored inside HTML pages. With XML this data can now be stored in a separate XML file. This way you can concentrate on using HTML for formatting and display, and be sure that changes in the underlying data will not force changes to any of your HTML code.
over the Internet. Converting the data to XML can greatly reduce this complexity and create data that can be read by different types of applications. XML can be used to store data in files or in databases - XML can also be used to store data in files or in databases. Applications can be written to store and retrieve information from the store, and generic applications can be used to display the data.
XML schema design for XML data structures
Several tools and XML schema languages exist. Interesting suggestions (and hints) are provided by Kennedy (2003) and Fitzgerald, M. (2002). Kennedy (2003) recommends an XML schema tool called RelaxNG, in association with RelaxNC. The key features of RELAX NG as an XML schema development tool is that it:
· is simple
· is easy to learn
· has both an XML syntax and a compact non-XML syntax
· does not change the information set of an XML document
· supports XML namespaces
· treats attributes uniformly with elements so far as possible
· has unrestricted support for unordered content
· has unrestricted support for mixed content
· has a solid theoretical basis
· can partner with a separate data typing language (such W3C XML Schema Data types).
Kennedy (2003) also suggests using ELM tree diagrams for mapping the data and structures, in conjunction with another tool called RELAX NC - which is the human friendly form of RELAX NG. A lot of students new to XML analysis take an instance, write it up in XML and then reverse-engineer the DTD or XML schema from that instance. However, ELM tree diagrams have been found to be useful in faster and effective forward engineering of the XML schema. Here are some steps to follow:
1. Do the data analysis using an Elm tree diagram for data and structure - see http://www.w3.org/XML/1998/06/xmlspec-report.htm#AEN101
2. Write the XML schema using RNC/RNG e.g. use a recipe.
3. Write up the document in XML.
Seven (7) rules for XML analysis
1. Code only the data
2. Use collection elements
3. Use groups of 'like' elements
4. Use elements to represent multi-valued data
5. Avoid mixed content
6. Use meaningful names and a standard case
7. Note that it is an iterative process.
Further tips
· With the XML tree structure, avoid 'tall trees' or 'wide hedges'
· Elements represent the data
· Attributes used for element meta-data
· Do not use attributes for content
· Mark-up documents e.g. user manual
· Mark-up documents for data transfer e.g. invoices.
The steps of: (1) mark-up XML; (2) create DTD (reverse engineer); (3) write XSL; are replaced by steps involving (1) Elm tree; (2) RelaxNC; (3) Mark-up XML; (4) write XSL.
Information integration as the problem: Is XML the solution?
While companies starting with a 'blank piece of paper' are at an advantage because they can store data in XML from scratch (and this is of increasing importance), most existing data repositories are still managed by relational database systems. Recent XML database research has focused on extending relational database backend to handle XML data efficiently. Special algorithms need to be designed and implemented on top of existing relational databases to support efficient database queries. For large systems, the update performance of the database in addition to query execution time will be scrutinised.
Exercise 8: XML introduction
Create an XML document for an online catalogue of cars where each car has the child elements of make, model, year, colour, engine, number_of_doors, transmission_type and accessories. The engine has child elements called number_of_cylinders and fuel_system.
SOAP and XML messaging
Simple Object Access Protocol (SOAP) is a specification developed by an industry consortium, including Microsoft and IBM - and given to apache.org as an open source standard. SOAP is implemented by many vendors including IBM, Sun and Microsoft following the same standards, which can be extended to produce specific XML messaging standards such as ebXML and Biztalk.
The specification uses XML to define the SOAP envelope as well as RPC data structures (XML schemas). This has seen SOAP and not CORBA, DCOM or RMI rise as the challenger in distributed objects architecture. While XML is seen as human readable, machine and language neutral, for some time, distributed objects systems have not been met with great enthusiasm, as they are not suited to web browser access and were difficult to implement, or in the case of CORBA binary objects, unable to penetrate firewalls. SOAP runs over HTTP or SMTP and therefore works with the firewall setup. SOAP was attractive as an open standard and has an easy to follow specification, in addition to being a lightweight (as opposed to CORBA's heavy reliance on support code). The lightweight challenger vs the heavyweight champ!
The next page explains, with the use of diagrams in Figures 5.1 & 5, how SOAP works. Vendors produce API's that handle the SOAP services for us, so we just need to learn the API to make SOAP systems work, such as Java's Web Services Development Pack, Apache SOAP 2.2 and IBM's Web Services Development Pack.
Dynamic discovery of web services (UDDI) and describing them (WSDL)
We can define a Web Service with Web Services Definition Language (WSDL). WSDL is an XML language used to describe what SOAP systems offer and is similar in concept to writing interfaces with IDL - as the WSDL file can be stored in the registry and discovered by clients, and help with workflow automation. In Topic 6 you can learn more about RMI and CORBA, which SOAP complements and improves. WSDL is used to mark-up, advertise and automate Web services for client businesses using UDDI in Topic 7. The Universal Description Discovery Integration (UDDI) – is seen as the phonebook-style register for e- commerce, where small businesses can describe themselves, discover other businesses and look at integration of services.
It is here that we conclude Topic 5 with firm evidence of XML as a core technology for e-commerce and business applications development of Web services – storing data, and transporting data and messages to objects. Topic 5 also gives an introduction to SOAP and its practical uses in accessing resources beyond the firewall, such as Database servers, CORBA, RMI and EJB systems, as well as in exposing a web application as a stand-alone application in Web services.
SOAP was good for developing lightweight distributed component applications - by wrapping existing applications, and in automated business processes.
No wonder SOAP is the lightweight challenger for the distributed objects crown - and a good way to introduce the other main distributed objects systems – RMI and CORBA, coming up next in Topic 6.
0 comments:
Post a Comment