Here, BIrefers to the product of
acquisition, interpretation, collation, assessment, and exploitation
of information in the business
domain [ 4]. CBizPort includes two
versions of its user interface—one
for simplified Chinese, one for traditional Chinese—each with the
same look and feel. Relying on a Virtual Arabic keyboard
conversion dictionary with 6,737
Chinese characters in each of the
two encodings (Big5 and GB2312),
the encoding converter converted
all Chinese characters into the
encoding of the interface version.
The eight information sources used
in the portal’s meta-searching are
major Chinese search engines and Click to summarize in
business-related portals from the 3 or 5 sentences.
three regions. Relying on two Chi-nese-phrase lexicons to extract
phrases, the portal’s categorizer
organizes retrieved Web pages into
various folders labeled with the key
phrases in the page summaries and
titles.
The Spanish Web portal (
SBizPort) supports searching and browsing of business information from
22 Spanish-speaking regions. In
addition to keyword searching,
summarization, and categorization,
like those in CBizPort, SBizPort
provides a comprehensive collection of business Web pages for
searching and supports visualization of retrieved pages (see Figure
2). Users visualize Web pages by
clicking on a region to see a list of
pages on the right and open pages by clicking the
link-embedded titles.
The Arabic Web portal, AMedPort (see Figure 3)
focuses on the medical domain in some 22 Arab
regions and supports all search and browse functions
available in SBizPort. AMedPort includes a customized user interface with a right-to-left text display
and a virtual keyboard to facilitate Arabic input.
Retrieved pages titles and
abstracts are listed.
Figure 3.
Screenshots from
AMedPort.
EXPERIMENTAL FINDINGS
Sixty native speakers participated in the experiments
(detailed in [ 3, 6]) of the three Web portals to evaluate the framework’s usability in supporting Web
searching in a multilingual world (see Table 2). In
each experiment (about an hour), a Web portal was
The user types “excretion” to
search for medical information
about the topic in Arab
regions.
Retrieved pages are
categorized into folders
labeled with key phrases.
The summary is listed on
the left, while he original
page is on the right.
The SOM visualizer categories
30 Web pages onto six regions
and displays hyperlinks on the
right.
compared with a benchmark search engine in each
of the three languages. Each subject was introduced
to the portal and the benchmark search engine and
was randomly assigned different task scenarios (one
scenario per system). Each scenario contained three
or four search and browse tasks based on standards
developed through the National Institute of Standards and Technology Text Retrieval Conference
( trec.nist.gov). All questionnaires used in the experiment were administered in the subjects’ native language. Subjects spent an average of three minutes to
finish a search task and eight minutes to finish a
browse task. The order in which the systems were
used was randomly assigned to avoid bias due to the
sequence of use.
Several information-retrieval measures—precision,
recall, and F value—revealed the search and browse
effectiveness of the system by computing the ratios
between the number of relevant results found by a