MOBY::CommonSubs.pm - a set of exportable subroutines that are useful in clients and services to deal with the input/output from MOBY Services
CommonSubs are used to do various manipulations of MOBY service-invocation and response messages. It is useful both Client and Service side and will ensure that the message structure is valid as per the latest version of the MOBY API.
It DOES NOT connect to MOBY Central for any of its functions, though it does contact the ontology server for some of its functions, so it may require a network connection.
NOTE THAT MOST LEGACY SUBROUTINES IN THIS MODULE HAVE BEEN DEPRECATED The code is still here for legacy reasons only, but the documentation has been removed permanently, so if there is a routine that you use that is no longer documented, consider yourself on dangerous ground. THIS CODE WILL NOT FUNCTION WITH SERVICES THAT ARE NOT COMPLIANT WITH THE NEW API - FOR EXAMPLE SERVICES THAT ARE NOT USING NAMED INPUTS AND OUTPUTS!
The following is a generalized architecture for all BioMOBY services showing how to parse response messages using the subroutines provided in CommonSubs
my $resp = $SI->execute(XMLInputList => \@input_data);
my $responses = serviceResponseParser($resp); # returns MOBY objects foreach my $queryID(keys %$responses){ $this_invocation = $responses->{$queryID}; # this is the <mobyData> block with this queryID my $this_output = "";
if (my $data = $this_invocation->{'responseArticleName'}){ # whatever your articleName is... # $data is a MOBY::Client::Simple|Collection|ParameterArticle my ($namespace) = @{$data->namespaces}; my $id = $data->id; my $XML_LibXML = $data->XML_DOM; # get access to the DOM # assuming that you have an element of type "String" # with articleName "Description" my $desc = getNodeContentWithArticle($XML_LibXML, "String", "Description"); ################### # DO SOMETHING TO RESPOSE DATA HERE ################### }
}
my $resp = $SI->execute(XMLInputList => \@input_data);
my $responses = serviceResponseParser($resp); # returns MOBY objects foreach my $queryID(keys %$responses){ # $inputs is a hashref of $input{queryid}->{articlename} = input object my $this_invocation = $responses->{$queryID}; if (my $data = $this_invocation->{'responseArticleName'}){ # $input is a MOBY::Client::Simple|Collection|Parameter object my $simples = $data->Simples; foreach my $simple(@$simples){ my ($ns) = @{$simple->namespaces}; my $id = $simple->id; my $XML_LibXML = $input->XML_DOM; # get access to the DOM
} } }
The following is a generalized architecture for *all* BioMOBY services showing how to parse incoming messages using the subroutines provided in CommonSubs
sub _generic_service_name { my ($caller, $data) = @_;
my $MOBY_RESPONSE;
my $inputs = serviceInputParser($data); # returns MOBY objects return SOAP::Data->type('base64' => responseHeader("illuminae.com") . responseFooter()) unless (keys %$inputs); foreach my $queryID(keys %$inputs){ $this_invocation = $inputs->{$queryID}; # this is the <mobyData> block with this queryID my $this_output = "";
if (my $input = $this_invocation->{incomingRequest}){ my ($namespace) = @{$input->namespaces}; my $id = $input->id; my $XML_LibXML = $input->XML_DOM; # get access to the DOM
################### # DO YOUR BUSINESS LOGIC HERE ###################
$this_output = "<moby:Object... rest of the output XML .../>"; }
$MOBY_RESPONSE .= simpleResponse( $this_output , "myArticleName" # the article name of that output object , $queryID); # the queryID of the input that we are responding to } return SOAP::Data->type('base64' => (responseHeader("illuminae.com") . $MOBY_RESPONSE . responseFooter)); }
sub _generic_service_returning_collections { my($caller, $message) = @_; my $inputs = serviceInputParser($message); my $MOBY_RESPONSE = ``''; # set empty response
# return empty SOAP envelope if ther is no moby input
return SOAP::Data->type('base64' => responseHeader().responseFooter()) unless (keys %$inputs);
foreach my $queryID(keys %$inputs){ # $inputs is a hashref of $input{queryid}->{articlename} = input object my $this_invocation = $inputs->{$queryID}; my @outputs; if (my $input = $this_invocation->{incomingArticleName}){ # $input is a MOBY::Client::Simple|Collection|Parameter object my $id = $input->id; my @agis = &_getMyOutputList($id); # this subroutine contains your business logic and returns a list of ids foreach (@agis){ push @outputs, "<Object namespace='MyNamespace' id='$_'/>"; } } $MOBY_RESPONSE .= collectionResponse (\@outputs, "myOutputArticleName", $queryID); } return SOAP::Data->type('base64' => (responseHeader("my.authority.org") . $MOBY_RESPONSE . responseFooter)); }
A COMPLETE EXAMPLE OF AN EASY MOBY SERVICE
This is a service that:
CONSUMES: base Object in the GO namespace EXECUTES: Retrieval PRODUCES: GO_Term (in the GO namespace)
# this subroutine is called from your dispatch_with line # in your SOAP daemon
sub getGoTerm {
use MOBY::CommonSubs qw{:all}; my ($caller, $incoming_message) = @_; my $MOBY_RESPONSE; # holds the response raw XML my @validNS = validateNamespaces("GO"); # do this if you intend to be namespace aware!
my $dbh = _connectToGoDatabase(); # connect to some database return SOAP::Data->type('base64' => responseHeader('my.authURI.com') . responseFooter()) unless $dbh; my $sth = $dbh->prepare(q{ # prepare your query select name, term_definition from term, term_definition where term.id = term_definition.term_id and acc=?});
my $inputs= serviceInputParser($incoming_message); # get incoming invocations # or fail properly with an empty response if there is no input return SOAP::Data->type('base64' => responseHeader("my.authURI.com") . responseFooter()) unless (keys %$inputs);
foreach my $queryID(keys %$inputs){ my $this_invocation = $inputs->{$queryID}; # this is the <mobyData> block with this queryID my $invocation_output=""; # prepare a variable to hold the output XML from this invocation
if (my $input = $this_invocation->{GO_id}){ # the articleName of your services input my ($namespace) = @{$input->namespaces}; # this is returned as a list! my $id = $input->id;
# optional - if we want to ENSURE that the incoming ID is in the GO namespace # we can validate it using the validateThisNamespace routine of CommonSubs # @validNS comes from validateNamespaces routine of CommonSubs (called above) next unless validateThisNamespace($namespace, @validNS);
# here's our business logic... $sth->execute($id); my ($term, $def) = $sth->fetchrow_array; if ($term){ $invocation_output = "<moby:GO_Term namespace='GO' id='$id'> <moby:String namespace='' id='' articleName='Term'>$term</moby:String> <moby:String namespace='' id='' articleName='Definition'>$def</moby:String> </moby:GO_Term>"; } }
# was our service execution successful? # if so, then build an output message # if not, build an empty output message $MOBY_RESPONSE .= simpleResponse( # simpleResponse is exported from CommonSubs $invocation_output # response for this query , "A_GoTerm" # the article name of that output object , $queryID); # the queryID of the input that we are responding to } # now return the result return SOAP::Data->type('base64' => (responseHeader("illuminae.com") . $MOBY_RESPONSE . responseFooter)); }
function: These routines will take a Moby invocation (server side usage) or response (client-side usage) and extract the Simple/Collection/Parameter objects out of it as MOBY::Client::SimpleArticle, MOBY::Client::CollectionArticle, and/or MOBY::Client::SecondaryArticle objects. The inputs are broken up into individual queryID's. Each queryID is associated with one or more individual articles, and each article is available by its articleName.
usage: my $inputs = serviceInputParser($MOBY_invocation_mssage));
usage: my $outputs = serviceResponseParser($MOBY_response_message));
args: $message
- this is the SOAP payload; i.e. the XML document containing the MOBY message
returns: $inputs
or $outputs
are a hashref with the following structure:
$Xputs->{$queryID}->{articleName} = MOBY::Client::SimpleArticle | MOBY::Client::CollectionArticle | MOBY::Client::SecondaryArticle
the SimpleArticle and CollectionArticle have methods to provide you with their objectType, their namespace and their ID. If you want to get more out of them, you should retrieve the XML::LibXML DOM using the ->XML_DOM method call in either of those objects. This can be passed into other CommonSubs routines such as getNodeContentWithArticle in order to retrieve sub-components of the Moby object you have in-hand.
See also:
For example, the input message:
<mobyData queryID = 'a1a'> <Simple articleName='name1'> <Object namespace=blah id=blah/> </Simple> <Parameter articleName='cutoff'> <Value>10</Value> </Parameter> </mobyData>
will become:
$inputs->{a1a}->{name1} = $MOBY::Client::Simple, # the <Simple> block $inputs->{a1a}->{cutoff} = $MOBY::Client::Secondary # <Parameter> block
With inputs that have collections these are presented as a listref of Simple article DOM's. So for the following message:
<mobyData queryID = '2b2'> <Collection articleName='name1'> <Simple> <Object namespace=blah id=blah/> </Simple> <Simple> <Object namespace=blah id=blah/> </Simple> </Collection> <Parameter articleName='cutoff'> <Value>10</Value> </Parameter> </mobyData>
will become
$inputs->{2b2}->{name1} = $MOBY::Client::Collection, # the <Collection> Block $inputs->{2b2}->{cutoff} = $MOBY::Client::Secondary, # the <Parameter> Block
This section describes functionality associated with identifying parts of a message, and checking that it is valid.
function: tests XML (text) or an XML DOM node to see if it represents a Simple, Collection, or Secondary article
These routines are unlikely to be useful in a MOBY Object oriented service but they have been retained for legacy reasons.
usage:
if (isSimpleArticle($node)){do something to it}
or
if (isCollectionArticle($node)){do something to it}
or
if (isSecondaryArticle($node)){do something to it}
input : an XML::LibXML node, an XML::LibXML::Document or straight XML
returns: boolean
function: checks the namespace ontology for the namespace lsid
usage: @LSIDs = validateNamespaces(@namespaces)
args: ordered list of either human-readable or lsid presumptive namespaces
returns: ordered list of the LSID's corresponding to those presumptive namespaces; undef for each namespace that was invalid
function: checks a given namespace against a list of valid namespaces
usage: $valid = validateThisNamespace($ns, @validNS);
args: ordered list of the namespace of interest and the list of valid NS's
returns: boolean
This section describes how to construct output, in response to an incoming message. Responses come in three varieties:
article(s)
collection(s)
of simples
function: wraps a simple article in the appropriate (mobyData) structure. Works only for simple articles. If you need to mix simples with collections and/or secondaries use complexReponse instead.
usage: $responseBody = simpleResponse($object, $ArticleName, $queryID);
args: (in order)
$object
- (optional) a MOBY Object as raw XML.
$articleName
- (optional) an article name for this article.
$queryID
- (optional, but strongly recommended) the query ID value for the mobyData block to which you are responding.
notes: As required by the API you must return a response for every input. If one of the inputs was invalid, you return a valid (empty) MOBY response by calling simpleResponse(undef, undef, $queryID) with no arguments.
function: wraps a set of articles in the appropriate mobyData structure. Works only for collection articles. If you need to mix collections with simples and/or secondaries use complexReponse instead.
usage: $responseBody = collectionResponse(\@objects, $articleName, $queryID);
args: (in order)
\@objects
- (optional) a listref of MOBY Objects as raw XML.
$articleName
- (optional) an artice name for this article.
$queryID
- (optional, but strongly recommended) the ID of the query to which you are responding.
notes: as required by the API you must return a response for every input. If one of the inputs was invalid, you return a valid (empty) MOBY response by calling collectionResponse(undef, undef, $queryID).
function: wraps articles in the appropriate (mobyData) structure. Can be used to send any combination of the three BioMOBY article types - simple, collection and secondary - back to a client.
usage: $responseBody = complexResponse(\@articles, $queryID);
args: (in order)
\@articles
- (optional) a listref of arrays. Each element of @articles is
itself a listref of [$articleName, $article], where $article is either
the article's raw XML for simples and secondaries or a reference to an array containing
[$articleName, $simpleXML] elements for a collection of simples.
$queryID
- the queryID value for
the mobyData block to which you are responding
notes: as required by the API you must return a response for every input. If one of the inputs was invalid, you return a valid (empty) MOBY response by calling complexResponse(undef, $queryID) with no arguments.
function: print the XML string of a MOBY response header +/- serviceNotes +/- Exceptions
usage:
responseHeader('illuminae.com')
responseHeader( -authority => 'illuminae.com', -note => 'here is some data from the service provider' -exception=>'an xml encoded exception string')
args: a string representing the service providers authority URI, OR a set of named arguments with the authority and the service provision notes which can include already xml encoded exceptions
caveat :
notes: returns everything required up to the response articles themselves. i.e. something like:
<?xml version='1.0' encoding='UTF-8'?> <moby:MOBY xmlns:moby='http://www.biomoby.org/moby'> <moby:Response moby:authority='http://www.illuminae.com'>
function: wraps a Biomoby Exception with all its parameters into the appropiate MobyData structure
usage:
encodeException( -refElement => 'refers to the queryID of the offending input mobyData', -refQueryID => 'refers to the articleName of the offending input Simple or Collection' -severity=>'error' -exceptionCode=>'An error code ' -exceptionMessage=>'a human readable description for the error code')
args:the different arguments required by the mobyException API severity can be either error, warning or information valid error codes are decribed on the biomoby website
notes: returns everything required to use for the responseHeader:
<moby:mobyException moby:refElement='input1' moby:refQueryID='1' moby:severity =''> <moby:exceptionCode>600</moby:exceptionCode> <moby:exceptionMessage>Unable to execute the service</moby:exceptionMessage> </moby:mobyException>
function: print the XML string of a MOBY response footer
usage:
return responseHeader('illuminae.com') . $DATA . responseFooter;
notes: returns everything required after the response articles themselves i.e. something like:
</moby:Response> </moby:MOBY>
This section contains subroutines that handle processing of optional message elements containing meta-data. Examples are the ServiceNotes, and CrossReference blocks.
function: to get the content of the Service Notes block of the MOBY message
usage: getServiceNotes($message)
args: $message
is either the XML::LibXML of the MOBY message, or plain XML
returns: String content of the ServiceNotes block of the MOBY Message
function: to get the content of the Exception part of the Service Notes block of the MOBY message
usage: getExceptions($message)
args: $message
is either the XML::LibXML of the MOBY message, or plain XML
returns: an Array of Hashes containing the exception Elemtents and attributes
example: my @ex=getExceptions($XML); foreach my $exception (@ex) { print ``Reference:'',$exception->{refElement},``\n''; print ``Query ID:'',$exception->{refQueryID},``\n''; print ``Severity of message:'',$exception->{severity},``\n''; print ``Readable message:'',$exception->{exceptionMessage},``\n''; print ``Exception Code:'',$exception->{exceptionCode},``\n''; }
function: to get the cross-references for a Simple article. This is primarily a Client-side function, since service providers should not, usually, be interpreting Cross-references.
usage: @xrefs = getCrossReferences($XML)
args: $XML
is either a SIMPLE article (<Simple>...</Simple>) or an
object (the payload of a Simple article), and may be either raw XML or
an XML::LibXML node.
returns: an array of MOBY::CrossReference objects
example:
my (($colls, $simps) = getResponseArticles($query); # returns DOM nodes foreach (@{$simps}){ my @xrefs = getCrossReferences($_); foreach my $xref(@xrefs){ print "Cross-ref type: ",$xref->type,"\n"; print "namespace: ",$xref->namespace,"\n"; print "id: ",$xref->id,"\n"; if ($xref->type eq "Xref"){ print "Cross-ref relationship: ", $xref->xref_type,"\n"; } } }
This section contains routines that didn't quite seem to fit anywhere else.
function: give me a DOM, a TagName, an articleName and I will return you the content of that node **as a string** (beware if there are additional XML tags in there!) this is meant for MOBYesque PRIMITIVES - things like: <String articleName=``SequenceString''>TAGCTGATCGAGCTGATGCTGA </String> call _getNodeContentsWithAttribute($DOM_NODE, ``String'', ``SequenceString'') and I will return ``TACGATGCTAGCTAGCGATCGG'' Caveat Emptor - I will NOT chop off leading and trailing whitespace or carriage returns, as these might be meaningful!
usage: @content = getNodeContentWithArticle($XML_DOM, $elementname, $articleName)
args: $XML_DOM is the DOM of a MOBY Object $elementName is the Tag of an XML element $articleName is the articleName attribute of the desired XML element
returns: an array of strings representing every line of every match (you probably want to concatenate these with a ``join''... probably...
example: given <SomeObject namespace='' id=''> <String namespace='' id='' articleName=``SequenceString''>TAGCTGATCGAGCTGATGCTGA </String> </SomeObject>
my $seq = getNodeContentWithArticle($DOM, "String", "SequenceString"); print "yeah!" if $seq eq "TAGCTGATCGAGCTGATGCTGA";
function: select the parent node from nodeList that is closest to the querynode
usage:
($term, $lsid) = whichDeepestParentObject($CENTRAL, $queryTerm, \@termList)
args:
$CENTRAL
- your MOBY::Client::Central object
$queryTerm
- the object type I am interested in
\@termlist
- the list of object types that I know about
returns: an ontology term and LSID as a scalar, or undef if there is no parent of this node in the nodelist. note that it will only return the term if you give it term names in the @termList. If you give it LSID's in the termList, then both the parameters returned will be LSID's - it doesn't back-translate...)
usage: $object->_rearrange( array_ref, list_of_arguments)
Purpose : Rearranges named parameters to requested order.
Example:
$self->_rearrange([qw(SEQUENCE ID DESC)],@param);
Where @param = (-sequence =
$s, -desc => $d, -id => $i);>
returns: @params
- an array of parameters in the requested order.
The above example would return ($s, $i, $d).
Unspecified parameters will return undef. For example, if
@param = (-sequence =
$s);>
the above _rearrange call would return ($s, undef, undef)
Argument: $order
: a reference to an array which describes the desired order of the named parameters.
@param :
an array of parameters, either as a list (in which case the function
simply returns the list), or as an associative array with hyphenated
tags (in which case the function sorts the values according to
@{$order} and returns that new array.) The tags can be upper, lower,
or mixed case but they must start with a hyphen (at least the first
one should be hyphenated.)
Source: This function was taken from CGI.pm, written by Dr. Lincoln Stein, and adapted for use in Bio::Seq by Richard Resnick and then adapted for use in Bio::Root::Object.pm by Steve Chervitz, then migrated into Bio::Root::RootI.pm by Ewan Birney.
Comments: Uppercase tags are the norm, (SAC) This method may not be appropriate for method calls that are within in an inner loop if efficiency is a concern.
Parameters can be specified using any of these formats: @param = (-name=>'me', -color=>'blue'); @param = (-NAME=>'me', -COLOR=>'blue'); @param = (-Name=>'me', -Color=>'blue'); @param = ('me', 'blue');
A leading hyphenated argument is used by this function to indicate that named parameters are being used. Therefore, the ('me', 'blue') list will be returned as-is.
Note that Perl will confuse unquoted, hyphenated tags as function
calls if there is a function of the same name in the current
namespace: -name =
'foo'> is interpreted as -&name =
'foo'>
For ultimate safety, put single quotes around the tag: ('-name'=
'me', '-color' =>'blue');>
This can be a bit cumbersome and I find not as readable as using all
uppercase, which is also fairly safe:(-NAME=
'me', -COLOR =>'blue');>
Personal note (SAC): I have found all uppercase tags to be more managable: it involves less single-quoting, the key names stand out better, and there are no method naming conflicts. The drawbacks are that it's not as easy to type as lowercase, and lots of uppercase can be hard to read. Regardless of the style, it greatly helps to line the parameters up vertically for long/complex lists.
DEPRECATED
DEPRECATED
DEPRECATED
DEPRECATED
DEPRECATED
DEPRECATED
DEPRECATED
DEPRECATED
DEPRECATED
DEPRECATED
DEPRECATED
DEPRECATED
DEPRECATED
Mark Wilkinson (markw at illuminae dot com), Pieter Neerincx, Frank Gibbons
BioMOBY Project: http://www.biomoby.org