their inputs; and how to evaluate them.
Later, we will discuss these challenges
and the corresponding dimensions in
detail. Here, we discuss the remaining
three dimensions: degree of manual
effort, role of human users, and standalone versus piggyback architectures.
Degree of manual effort. When building a CS system, we must decide how
much manual effort is required to solve
each of the four CS challenges. This can
range from relatively little (for example,
combining ratings) to substantial (for
example, combining code), and clearly
also depends on how much the system
is automated. We must decide how to
divide the manual effort between the
users and the system owners. Some
systems ask the users to do relatively
little and the owners a great deal. For
example, to detect malicious users,
the users may simply click a button to
report suspicious behaviors, whereas
the owners must carefully examine all
relevant evidence to determine if a user
is indeed malicious. Some systems do
the reverse. For example, most of the
manual burden of merging Wikipedia
edits falls on the users (who are currently editing), not the owners.
Role of human users. We consider
four basic roles of humans in a CS
system. Slaves: humans help solve
the problem in a divide-and-conquer
fashion, to minimize the resources
(for example, time, effort) of the owners. Examples are ESP and finding a
missing boat in satellite images using
Mechanical Turk. Perspective providers: humans contribute different perspectives, which when combined often
produce a better solution (than with a
single human). Examples are reviewing books and aggregating user bets to
make predictions.
29 Content providers:
humans contribute self-generated content (for example, videos on YouTube,
images on Flickr). Component providers: humans function as components
in the target artifact, such as a social
network, or simply just a community
of users (so that the owner can, say, sell
ads). Humans often play multiple roles
within a single CS system (for example,
slaves, perspective providers, and content providers in Wikipedia). It is important to know these roles because
that may determine how to recruit. For
example, to use humans as perspective
providers, it is important to recruit a
Compared to the
physical world,
the Web can
dramatically
improve existing
crowdsourcing
systems and give
birth to novel
system types.
diverse crowd where each human can
make independent decisions, to avoid
“group think.”
29
Standalone versus piggyback. When
building a CS system, we may decide to
piggyback on a well-established system,
by exploiting traces that users leave in
that system to solve our target problem.
For example, Google’s “Did you mean”
and Yahoo’s Search Assist utilize the
search log and user clicks of a search
engine to correct spelling mistakes. Another system may exploit user purchases in an online bookstore (Amazon) to
recommend books. Unlike standalone
systems, such piggyback systems do not
have to solve the challenges of recruiting users and deciding what they can
do. But they still have to decide how to
evaluate users and their inputs (such
as traces in this case), and to combine
such inputs to solve the target problem.
sample Cs systems on the Web
Building on this discussion of CS dimensions, we now focus on CS systems
on the Web, first describing a set of
basic system types, and then showing
how deployed CS systems often combine multiple such types.
The accompanying table shows a
set of basic CS system types. The set is
not meant to be exhaustive; it shows
only those types that have received
most attention. From left to right, it
is organized by collaboration, architecture, the need to recruit users, and
then by the actions users can take. We
now discuss the set, starting with explicit systems.
Explicit Systems: These standalone
systems let users collaborate explicitly.
In particular, users can evaluate, share,
network, build artifacts, and execute
tasks. We discuss these systems in turn.
Evaluating: These systems let users
evaluate “items” (for example, books,
movies, Web pages, other users) using
textual comments, numeric scores, or
tags.
10
Sharing: These systems let users
share “items” such as products, services, textual knowledge, and structured
knowledge. Systems that share products and services include Napster, YouTube, CPAN, and the site programma-
bleweb.com (for sharing files, videos,
software, and mashups, respectively).
Systems that share textual knowledge
include mailing lists, Twitter, how-to