Joho the Blog » liveblog

June 2, 2013

[2b2k] Knowledge in its natural state

I gave a 20 minute talk at the Wired Next Fest in Milan on June 1, 2013. Because I needed to keep the talk to its allotted time and because it was being simultaneously translated into Italian, I wrote it out and gave a copy to the translators. Inevitably, I veered from the script a bit, but not all that much. What follows is the script with the veerings that I can remember. The paragraph breaks track to the slide changes

(I began by thanking the festival, and my progressive Italian publisher, Codice Edizioni Codice are pragmatic idealists and have been fantastic to work with.)

Knowledge seems to fit so perfectly into books. But to marvel at how well Knowledge fits into books…

… is to marvel at how well each rock fits into its hole in the ground. Knowledge fits books because we’ve shaped knowledge around books and paper.

And knowledge has taken on the properties of books and paper. Like books, knowledge is ordered and orderly. It is bounded, just as books stretch from cover to cover. It is the product of an individual mind that then is filtered. It is kept private and we’re not responsible for it until it’s published. Once published, it cannot be undone. It creates a privileged class of experts, like the privileged books that are chosen to be published and then chosen to be in a library

Released from the bounds of paper, knowledge takes on the shape of its new medium, the Internet. It takes on the properties of its new medium just it had taken on the properties of its old paper medium. It’s my argument today that networked knowledge assumes a more natural shape. Here are some of the properties of new, networked knowledge

1. First, because it’s a network, it’s linked.

2. These links have no natural stopping point for your travels. If anything, the network gives you temptations to continue, not stopping points.

3. And, like the Net, it’s too big for any one head, Michael Nielsen, the author of Reinventing Discovery, uses the discovery of the Higgs Boson as an example. That discovery required gigantic networks of equipment and vast networks of people. There is no one person who understands everything about the system that proved that that particle exists. That knowledge lives in the system, in the network.

4. Like the net, networked knowledge is in perpetual disagreement. There is nothing about which everyone agrees. We like to believe this is a temporary state, but after thousands of years of recorded history, we can now see for sure that we are never going to agree about anything. The hope for networked knoweldge is that we’re learning to disagree more fruitfully, in a linked environment

5. And, as the Internet makes very clear, we are fallible creatures. We get everything wrong. So, networked knowledge becomes more credible when it acknowledges fallibility. This is very different from the old paper based authorities who saw fallibility as a challenge to their authority.

6. Finally, knowledge is taking on the humor of the Internet. We’re on the Internet voluntarily and freed of the constrictions of paper, it turns out that we like being with one another. Even when the topic is serious like this topic at Reddit [a discussion of a physics headline], within a few comments, we’re making jokes. And then going back to the serious topic. Paper squeezed the humor out of knowledge. But that’s unnatural.

These properties of networked knowledge are also properties of the Network. But they’re also properties that are more human and more natural than the properties of traditional knowledge.

But there’s one problem:

There is no such thing as natural knowledge. Knowledge is a construct. Our medium may have changed, but we haven’t, at least so it seems. And so we’re not free to reinvent knowledge any way we’d like. Significant problems based on human tendencies are emerging. I’ll point to four quick problem areas.

First, We see the old patterns of concentration of power reemerge on the Net. Some sites have an enormous number of viewers, but the vast majority of sites have very few. [Slide shows Clay Shirky’s Power Law distribution chart, and a photo of Clay]

Albert-László Barabási has shown that this type of clustering is typical of networks even in nature, and it is certainly true of the Internet

Second, on the Internet, without paper to anchor it, knowledge often loses its context. A tweet…

Slips free into the wild…

It gets retweeted and perhaps loses its author

And then gets retweeted and lose its meaning. And now it circulates as fact. [My example was a tweet about the government not allowing us to sell body parts morphing into a tweet about the government selling body parts. I made it up.]

Third, the Internet provides an incentive to overstate.

Fourth, even though the Net contains lots of different sorts of people and ideas and thus should be making us more open in our beliefs…

… we tend to hang out with people who are like us. It’s a natural human thing to prefer people “like us,” or “people we’re comfortable with.” And this leads to confirmation bias — our existing beliefs get reinforced — and possibly to polarization, in which our beliefs become more extreme.

This is known as the echo chamber problem, and it’s a real problem. I personally think it’s been overstated, but it is definitely there.

So there are four problems with networked knowledge. Not one of them is new. Each has a analog from before the Net.

  1. The loss of context has always been with us. Most of what we believe we believe because we believe it, not because of evidence. At its best we call it, in English, common sense. But history has shown us that common sense can include absurdities and lead to great injustices.

  2. Yes, the Net is not a flat, totally equal place. But it is far less centralized than the old media were, where only a handful of people were allowed to broadcast their ideas and to choose which ideas were broadcast.

  3. Certainly the Internet tends towards overstatement. But we have had mass media that have been built on running over-stated headlines. This newspaper [Weekly World News] is a humor paper, but it’s hard to distinguish from serious broadcast news.

  4. And speaking of Fox, yes, on the Internet we can simply stick with ideas that we already agree with, and get more confirmed in our beliefs. But that too is nothing new. The old media actually were able to put us into even more tightly controlled echo chambers. We are more likely to run into opposing ideas — and even just to recognize that there are opposing ideas — on the Net than in a rightwing or leftwing newspaper.

It’s not simply that all the old problems with knowledge have reemerged. Rather, they’ve re-emerged in an environment that offers new and sometimes quite substantial ways around them.

  1. For example, if something loses its context, we can search for that context. And links often add context.

  2. And, yes, the Net forms hubs, but as Clay Shirky and Chris Anderson have pointed out, the Net also lets a long tail form, so that voices that in the past simply could not have been heard, now can be. And the activity in that long tail surpasses the attention paid to the head of the tail.

  3. Yes, we often tend to overstate things on the Net, but we also have a set of quite powerful tools for pushing back. We review our reviews. We have sites the well-regard American site, Snopes.com, that will tell you if some Internet rumor is true. And it’s highly reliable. Then we have all of the ways we talk with one another on the Net, evaluating the truth of what we’ve read there.

  4. And, the echo chamber is a real danger, but we also have on the Net the occasional fulfillment of our old ideal of being able to have honest, respectful conversations with people with whom we fundamentally disagree. These examples are from Reddit, but there are others.

So, yes, there are problems of knowledge that persist even when our technology of knowledge changes. That’s because these are not technical problems so much as human problems…

…and thus require human solutions. And the fundamental solution is that we need to become more self-aware about knowledge.

Our old technology — paper — gave us an idea of knowledge that said that knowledge comes from experts who are filtered, printed, and then it’s settled, because that’s how books work. Our new technology shows us we are complicit in knowing. In order to let knowledge get as big as our new medium allows, we have to recognize that knowledge comes from all of us (including experts), it is to be linked, shared, discussed, argued about, made fun of, and is never finished and done. It is thoroughly ours – something we build together, not a product manufactured by unknown experts and delivered to us as if it were more than merely human.

The required human solution therefore is to accept our human responsibility for knowledge, to embrace and improve the technology that gives knowledge to us –- for example, by embracing Open Access and the culture of linking and of the Net, and to be explicit about these values.

Becoming explicit is vital because our old medium of knowledge did its best to hide the human qualities of knowledge. Our new medium makes that responsibility inescapable. With the crumbling of the paper authorities, it bcomes more urgent than ever that we assume personal and social responsibility for what we know.

Knowing is an unnatural act. If we can remember that –- remember the human role in knowing — we now have the tools and connections that will enable even everyday knowledge to scale to a dimension envisioned in the past only by the mad and the God-inspired.

Thank you.

1 Comment »

May 15, 2013

[meshcon] Ryan Carson of Treehouse

Ryan Carson [twitter:RyanCarson] of Treehouse at the Mesh Conference is keynoting the Mesh Conference. He begins his introduction of himself by saying he is a father, which I appreciate. Treehouse is an “online education company that teaches technology. We hope we can remove the need to go to university to do technology.”

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

Treehouse “treasures personal time.” They work a 4-day week, 8 hours a day, although they pay for a full 40-hour week. He asks how many people in the audience work for themselves or run their own company; half the people raise their hands. “We have a fundamental belief that people can work smarter, and thus faster…We use a lot of tools that decrease drag.” E.g., they have an internal version of Reddit called “Convoy.” It keeps conversation out of email. “We ask people to never put anything in email that isn’t actionable.” A 4 day week also makes recruiting easy.

“As a father, I realize I’m going to die, sooner rather than later. If I work four days a week, I can send 50% more of my life with my wife and kids.”

Q: Why not a 3 day week?

A: It’s a flag to say “We believe personal time is important.” We’ll do whatever we have to. I’ve told people not to send email over the weekend because it makes work for others.

Q: How about flex time instead?

A: We have tried that, and we let people work from home. “People are smart and motivated and want to succeed. We presume that about people.” We’re demanding, and we’ll fire people if they don’t perform. But you have to institute practices, and not just say that you believe in personal time.

Q: Do you have investors? How do they respond?

A: We have $12M in investment. But we didn’t raise money until after we were profitable. I used my experience running 3 prior companies to give investors confidence. And no one asked about the 4 day week. It doesn’t seem to matter to them. My prior company was an events company and it got bought by a company that worked 5 days a week, and it was messy. I think our team there is now working 5 days.

Q: How do you provide 7 day a week support?

A: Our support team time shifts.

Q: How do you control email so that it’s only actionable?

A: It’s a policy. Also, we use Boomerang which lets us schedule when email is sent.

Now Ryan talks about the tools they use to facilitate a distributed team: about 30 people in Orlando, 8 in Portland, and the rest are distributed in the US and UK. “We don’t have a headquarters.” We are an Internet company. We use Convoy: part water cooler, part news distribution. Notes from meetings go there. It took a dev about a day to create Convoy.

We also use Campfire, a chat program. And Trello for task management. And Google Hangouts. (He notes that you have to be wired, not wifi, and have good gear, for Hangouts to work well.)

Q: Do you have to work over the weekend when there’s a hard deadline? And do you put more of an emphasis on planning?

A: Yes, we sometimes have worked over the weekend. And we’ve sometimes had a problem with people working too much. I think some people work without telling us, especially developers and designers. But if they have to work, their managers have failed. And it does mean we have to plan carefully.

Q: What are your annual meetups like?

A: It’s a full week. No agenda, no working. Pure get drunk, have fun. People work much harder if they like each other and believe in each other.

Now on education. By 2020, there will be 1,000,000 jobs in tech than students. Nine out of ten high schools don’t even offer computer programming classes. [Really? Apparently so. Wow.] Treehouse tries to address this, along with Udacity, CodeAcademy, Code School. In a video, Ryan says that Treehouse will cost you about $300 for an entire course of tech education, making you ready to enter the workforce. “The education system is a racket. Universities have milked us dry for ten years.” 40% of jobs in STEM are in computer science, but only 2% of STEM students are studying it. “In 41 out of 50 states coding classes don’t count toward high school graduation math or science requirements.” “In the future, most students won’t get a four year degree, and I think that’s a good thing. We are moving toward a trade school model.”

Q: Many companies use college degrees as a filter. How do you filter?

A: In 5 yrs there won’t be enough graduates for you to hire anyone because Google and FB will pay them $500,000/year. At Treehouse we apply points. You can see someone’s skills.

Q: What will people miss out on if they don’t go to college?

A: People will miss out on the social aspect, but people can’t afford to go into debt for that. College as the next step is a new idea in the past 15 years. [Really?] You’ll have free liberal arts education available through free online courses. You’ll pay for trade school training. “We’ll just have to have faith that people can be responsible adults without going to university.”

Q: How do you help people who complete your courses find job?

A: We’re rolling out an entire department for this. As you learn on Treehouse, you get points and start to establish your rank. Employers will be able to search our database saying, e.g., “I want someone with over 1,000 points in CSS, 800 points in Javascript, and 500 points in business.”

Q: How are you going to mesh these ideas into traditional education?

A: Sub-par universities will die. Education will be completely different in 10 years. We don’t know what it will be.

Ryan says that he’s not doing this for the money. “People who need education can’t afford it.”

[Judy Lee tweeted that Ryan should have asked us how many in the audience have a university degree, and how many of us regret it. Nice.]

3 Comments »

April 9, 2013

[berkman] Derek Khanna on connecting the dots

Derek Khanna is giving a Berkman talk on trying to connect the dots so that policy-makers “get it.” “How do we even frame discussions about the economy and innovation?” Copyright law hasn’t been re-assessed in at least 15 yrs, he says. He begins with his bakcstory: He’s from Mass. Worked for Romney and Scott Brown. (Derek wrote the copyright reform report for the Republican Study Group.)

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

Rule 1: “Being right is just part of the battle.” Rule 2: “It’s less important what you say…It’s most important who says it.” Rule 3: “Control the framing of the issue.” E.g., we [copyright reformers] frame copyright very differently than does Capitol Hill.

Take SOPA. He quotes Adam Green saying it’s not a matter of right vs. wrong but old vs. new. Staffers had been warning about SOPA, but suddenly the public engaged. The result was astounding: Co-sponsors became opponents of the bill. Derek says it wasn’t Google that killed SOPA. It was the 3 million people reaching out to Congress that killed it. “People like Elizabeth Stark, Alexis Ohanian [reddit] and Aaron Swartz.” The RIAA and MPAA like to frame it as having lost to Google rather than having lost to the American people. (He points to a Mario Savio speech that begins “There’s a time when the operation of the machine becomes so odious…”) SOPA remains very much on Congress’ mind, he says.

The framing was “perfect”: SOPA will censor the Internet and inhibit innovation.

Most conversations about copyright are framed as: Piracy is rampant, costing American jobs. Content is a crucial export, “the only thing produced in US any more.” Copyright is thus good, but more copyright is better.

Derek set out to reframe it in his “Three Myths of Copyright.” At a panel he asked “Who thinks terrorism is bad? Who thinks the TSA is only the way to protect us?” Likewise, is copyright the only way to protect content when it makes 23M Americans into felons? He points to the difference between the original copyright law and the current one. To conservatives, it can be framed as looking like a wild divergence from the original intent.

The “Three Myths” memo went out and was supported by conservatives until 24 hours later when it was pulled. A few weeks later, Derek was fired. He’s continuing but he thinks that when you’re on the outside, you have to fight small, strategic battles.

Idea + Movement + Effort = Legislation

A few weeks ago the head of the copyright office endorsed many of the reforms in “Three Myths,” updating copyright for the digital generation. The day before the content industry made the old argument in Roll Call. The other side isn’t countering. The content lobby knows that Roll Call is read by Congress. We need similar expertise.

How do we start?

  1. Don’t wait for the next SOPA. They’re going to be much subtler in how they do it next time. Sites are still being taken down, e.g., Megaupload. Also funding mechanisms were cut off for ThePiratesBay. Also, Google was forced to take down links to torrents, etc. So, why would they come up with another SOPA? Instead they’re using international treaties to codify the DMCA forever, using stock language that gets replicated in treaties. These treaties only require Senate approval, or through executive actions. Therefore, we have to be more activist.

  2. We have to analyze existing law.

  3. We need support from both the left and the right

  4. We need to focus on areas of common interest where we can form a collective whole

  5. Asymmetrical warfare: Where are we strong and they’re weak? Where have they overplayed their hand? E.g., if you want to take on copyright law, that’s not asymmetric because there’s a strong argument on the other side.

“We lack the institutional capacity to quickly intervene in the political process in the way the content industry has. We therefore need to be smarter and more tactical.” We should start with smaller battles. We should avoid the narrative of “fighting the Man,” that companies are evil, etc. That won’t win over a party that sees itself as a party of business. “Instead, foster a David v. Goliath narrative.” That media like that narrative.

We should not talk about piracy. And even if the DMCA needs to be replaced, that’s a non-starter on Capital Hill.

Derek’s first campaign was on cellphone unlocking, after the Librarian of Copyright said it was now illegal (i.e., ending the DMCA exemption) to enable your phone to be used on a different carrier. Unlocking would increase competition among carriers. Derek wrote an article for The Atlantic that pointed out that the technology for the blind also has to be exempted every three years, a clear example of how the system is broken. Derek expected this issue to be hard. It didn’t get any mainstream media attention. It has a $32M lobbying effort on the other side. “That’s a problem on Capitol Hill: We don’t have a lobby for the future.” IT requires making hypothetical arguments.

But as the argument went on, examples emerged. E.g., Republic Wireless offers very cheap connectivity, but it depends on users bringing in unlocked phones.

Derek started a White House petition that got 114,000 signatures, the largest at the time. In part this worked because of people’s prior experience with SOPA. There were positive arguments on Left and Right. Left: It’s a matter of fairness. Right: Property rights. Derek added to this the value of innovation as a cross-party value.

After the petition, the FCC announced an investigation, and the White House came out in favor of unlocking. Before that, Derek had urged Congressfolks to come out in favor of it, if only because he was worried that after Obama came out in favor of repeal, the right would take the other side. But shortly after Obama endorsed, some conservatives came out in favor. Bills were introduced in both chambers.

Unfortunately, we have no way of mobilizing the 114,000 people who signed the petition; the names couldn’t be captured.

Why was it successful?

  1. They made it simple. (Also with SOPA: SOPA = censorship)

  2. Leveraged social media

  3. Utilized video

  4. Created a diverse coalition

  5. Gained mainstream credibility

  6. Channeled energies into a measurable demonstration of support

  7. Kept Congress in the loop

  8. Solid media narrative

  9. Avoided talking about piracy. Instead: competition, innovation, and property rights.

  10. It was unfair

  11. Everyone has a phone…

Derek presented this at a conservative org and got called a Marxist. Fox Business also: “You’re just against contracts.” “When you take up an issue, you have to know where your third rails are.” Response: The contract is between you and your carrier; the feds shouldn’t be arresting people for violating a contract.

Why is it important? It’s the first time Congress has questioned the DMCA. We might get a hearing on it. Congress is unaware of the implications of the DMCA. It also helped Congress realize that international treaties are being used as a backdoor for these restrictions. It may affect the Trans-Pacific Partnership treaty. And it helped identify allies.

Bottom line: “A free society shouldn’t have to petition its govt every 3 years to allow access to tech.” It’s akin to free speech, he says.

On the CFAA: “The statute is terrible.” There’s consensus about this. “But no one has written about in Weekly Standard or Politico.” It hasn’t reached Congress’ attention. Most members of Congress think that the sky is falling when it comes to cybersecurity. Every time a cybersec bill comes up, Congress has experts telling them that we are in deep peril. “Essentially the arguments for CFAA are that we need to reduce the DoJ’s discretion.” You have to defeat that training. Meet with Rogers or McCain or the other cyber-hawks and convince them that the CFAA needs to be reformed, that we can target hacking with a more narrowly focused bill.

Q&A

Q: Can we try to drive a wedge in the opposition?

A: Yes. The RIAA’s and MPAA’s policies don’t foster innovation in their own industry. Over a 100 wireless carriers supported us on unlocking.

Q: You said that people who “get” tech are on the side of openness, etc. That optimistically suggests that if we educate people, they’ll take more common sense positions on tech.

A: Not entirely. Congress listens to people they trust, who are the RIAA, MPAA…

Q: …But even if Congressfolks fully understood tech, would the funds they get from the content industry still sway them?

A: Yes, some understand and still oppose us. But the ones who understand generally agree with us. The story is more complex: The MPAA/RIAA are very liberal, but the right still tend toward copyright protection.

Q: Why is the content industry so powerful, given the size of Google, etc.

A: AT&T and Verizon are both in the top ten of lobbying companies: $32M. Google spends about $6M on lobbying. “No tech company had a DC presence until Microsoft” when it was about to be broken up. Also, as the tech companies invest heavily to survive, say, patent law, why would you favor wholesale patent law change? Also, when the RIAA/MPAA sue kids, the money goes back into lobbying, not to the artists. They’re self-funding. But the tech industry has to justify why they’re spending money on lobbying.

Q: In Pakistan, piracy is rampant. Doesn’t that hurt innovation?

A: Piracy is real. But, those generally weren’t loss sales. The obsession with piracy is the problem.

Q: How about the role of public interest groups?

A: I’m a big fan of Public Knowledge and EFF, etc. But they need supplementing with more activist movements.

Q: If we focus on small victories, will people think we’re not doing enough? Will you have to keep winning bigger and bigger?

A: You can exist at a level for a while, if you’re strategic about it. Eventually you have to move on to bigger battles.

Q: How about the importance of multistake partnerships?

A: You need as many allies as you can. E.g., I’m interested in orphan works: in copyright but you can’t find the copyright holders. Our interests are in line with the RIAA.

A: Are we in a moment like the environmental movement before it formed under a single banner?

Q: I’m not an expert on the environmental movement. There are lots of lessons to be learned from them.

Q: Is there a schism in the conservatism over copyright reform?

A: I haven’t seen much of a schism. The best argument I’ve heard is the natural rights one: copyright ought to exist forever. But that’s not the system we’ve adopted. Our founding fathers rejected it. I’d like to build a cross-party coalition, but that’s a longtime goal.

Q: Did you get pushback on using the WH petition mechanism?

A: I got some from privacy folks.

Q: When we win a battle, the other side comes up with something more drastic. E.g., we won a first sale argument, but the right may be preparing something much more drastic. How can we avoid that?

A: I’m not sure they’re going to try to reverse the first sale doctrine, but we need to have our eyes open.

Q: What should we do right now?

A: We’d like to start to bring together the CISPA coalition.

3 Comments »

April 2, 2013

[berkman] Anil Dash on “The Web We Lost”

Anil Dash is giving a Berkman lunchtime talk, titled “The Web We Lost.” He begins by pointing out that the title of his talk implies a commonality that at least once was.

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

[Light editing on April 3 2013.]

Anil puts up an icon that is a symbol of privately-owned public spaces in New York City. Businesses create these spaces in order to be allowed to build buildings taller than the zoning requirements allow. These are sorta kinda like parks but are not. E.g., Occupy isn’t in Zuccotti Park any more because the space is a privately-own public space, not a park. “We need to understand the distinction” between the spaces we think are public and the ones that are privately owned.

We find out about these when we transgress rules. We expect to be able to transgress in public spaces, but in these privately-owned spaces we cannot. E.g., Improv Everywhere needs to operate anonymously to perform in these spaces. Anil asks us to imagine “a secretive, private ivy league club.” He is the son of immigrants and didn’t go to college. “A space even as welcoming as this one [Harvard Berkman] can seem intimidating.” E.g., Facebook was built as a private club. It welcomes everyone now, but it still doesn’t feel like it’s ours. It’s very hard for a business to get much past its origins.

One result of online privately-owned public spaces is “the wholesale destruction of your wedding photos.” When people lose them in a fire, they are distraught because those photos cannot be replaced. Yet everyday we hear about a startup that “succeeds” by selling out, and then destroying the content that they’d gathered. We’ve all gotten the emails that say: “Good news! 1. We’re getting rich. 2. You’re not. 3. We’re deleting your wedding photos.” They can do this because of the terms of service that none of us read but that give them carte blanche. We tend to look at this as simply the cost of doing business with the site.

But don’t see it that way, Anil urges. “This is actually a battle” against the values of the early Web. In the mid to late 1990s, the social Web arose. There was a time when it was meaningful thing to say that you’re a blogger. It was distinctive. Now being introduced as a blogger “is a little bit like being introduced as an emailer.” “No one’s a Facebooker.” The idea that there was a culture with shared values has been dismantled.

He challenges himself to substantiate this:

“We have a lot of software that forbids journalism.” He refers to the IoS [iphone operating system] Terms of Service for app developers that includes text that says, literally: “If you want to criticize a religion, write a book.” You can distribute that book through the Apple bookstore, but Apple doesn’t want you writing apps that criticize religion. Apple enforces an anti-journalism rule, banning an app that shows where drone strikes have been.

Less visibly, the laws is being bent “to make our controlling our data illegal.” All the social networks operate as common carriers — neutral substrates — except when it comes to monetizing. The boundaries are unclear: I can sing “Happy Birthday” to a child at home, and I can do it over FaceTime, but I can’t put it up at YouTube [because of copyright]. It’s very open-ended and difficult to figure. “Now we have the industry that creates the social network implicitly interested in getting involved in how IP laws evolve.” When the Google home page encourages visitors to call their senators against SOPA/PIPA, we have what those of us against Citizens United oppose: we’re asking a big company to encourage people to act politically in a particular way. At the same time, we’re letting these companies capture our words and works and put them under IP law.

A decade ago, metadata was all the rage among the geeks. You could tag, geo-tag, or machine-tag Flickr photos. Flickr is from the old community. That’s why you can still do Creative Commons searches at Flickr. But you can’t on Instagram. They don’t care about metadata. From an end-user point of view, RSS is out of favor. The new companies are not investing in creating metadata to make their work discoverable and shareable.

At the old Suck.com, hovering on a link would reveal a punchline. Now, with the introduction of Adlinks and AdSense, Google transformed links from the informative and aesthetic, to an economic tool for search engine optimization (SEO). Within less than 6 months, linkspam was spawned. Today Facebook’s EdgeRank is based on the idea that “Likes” are an expression of your intent, which determines how FB charges for ads. We’ll see like-spammers and all the rest we saw with links. “These gestural things that were editorial or indicators of intent get corrupted right away.” There are still little islands, but for the most part these gestures that used to be about me telling you that I like your work are becoming economic actions.

Anil says that a while ago when people clicked on a link from Facebook to his blog, FB popped up a warning notice saying that it might be dangerous to go there. “The assumption is that my site is less trustworthy than theirs. Let’s say that’s true. Let’s say I’m trying to steal all your privacy and they’re not.” [audience laughs] He has FB comments on his site. To get this FB has to validate your page. “I explicitly opted in to the Facebook ecology” in part to prove he’s a moderate and in part as a convenience to his readers. At the same time, FB was letting the Washington Post and The Guardian publish within the FB walls, and FB never gave that warning when you clicked on their links. A friend at FB told Anil that the popup was a bug, which might be. But that means “in the best case, we’re stuck fixing their bugs on our budgets.” (The worst case is that FB is trying to shunt traffic away from other sites.)

And this is true for all things that compete with the Web. The ideas locked into apps won’t survive the company’s acquisition, but this is true when we change devices as well. “Content tied to devices dies when those devices become obsolete.” We have “given up on standard formats.” “Those of us who cared about this stuff…have lost,” overall. Very few apps support standard formats, with jpg and html as exceptions. Likes and follows, etc., all use undocumented proprietary formats. The most dramatic shift: we’ve lost the expectation that they would be interoperable. The Web was built out of interoperability. “This went away with almost no public discourse about the implications of it.”

The most important implication of all this comes when thinking about the Web as a public space. When the President goes on FB, we think about it as a public space, but it’s not, and dissent and transgression are not permitted. “Terms of Service and IP trump the Constitution.” E.g., every single message you put on FB during the election FB could have transformed into its opposite, and FB would be within its ToS rights. After Hurricane Sandy, public relief officials were broadcasting messages only through FB. “You had to be locked into FB to see where public relief was happening. A striking change.”

What’s most at risk are the words of everyday people. “It’s never the Pharaoh’s words that are lost to history.” Very few people opt out of FB. Anil is still on FB because he doesn’t want to lose contact with his in-laws. [See Dan Gillmor's talk last week.) Without these privately-owned public spaces, Anil wouldn't have been invited to Harvard; it's how he made his name.

"The main reason this shift happened in the social web is the arrogance of the people who cared about the social web in the early days...We did sincerely care about enabling all these positive things. But the way we went about it was so arrogant that Mark Zuckerberg's vision seemed more appealing, which is appalling." An Ivy League kid's software designed for a privileged, exclusive elite turned out to be more appealing than what folks like Anil were building. "If we had been listening more, and a little more open in self-criticism, it would have been very valuable."

There was a lot of triumphalism after PIPA/SOPA went down, but it took a huge amount of hyperbole: "Hollywood wants to destroy the First Amendment, etc." It worked once but it doesn't scale. The willingness to pat ourselves on our back uncritically led us to vilify people who support creative industries. That comes from the arrogance that they're dinosaurs, etc. People should see us being publicly critical of ourselves. For something to seem less inclusive than FB or Apple — incredibly arrogant, non-egalitarian cultures — that's something we should look at very self-critically.

Some of us want to say "But it's only some of the Web." We built the Web for pages, but increasingly we're moving from pages to streams (most recently-updated on top, generally), on our phones but also on bigger screens. Sites that were pages have become streams. E.g., YouTube and Yahoo. These streams feel like apps, not pages. Our arrogance keeps us thinking that the Web is still about pages. Nope. The percentage of time we spend online looking at streams is rapidly increasing. It is already dominant. This is important because these streams are controlled access. The host controls how we experience the content. "This is part of how they're controlling the conversation." No Open Web advocate has created a stream that's anywhere near as popular as the sites we're going to. The geeks tend to fight the last battle. "Let's make an open source version of the current thing." Instead, geeks need to think about creating a new kind of stream. People never switch to more open apps. (Anil says Firefox was an exception.)

So, what do we do? Social technologies follow patterns. It's cyclical. (E.g., "mainframes being rebranded as The Cloud.") Google is doing just about everything Microsoft was doing in the late 1990s. We should expect a reaction against their overreach. With Microsoft, "policy really worked." The Consent Decree made IE an afterthought for developers. Public policy can be an important of this change. "There's no question" that policy over social software is coming.

Also, some "apps want to do the right thing." Anil's ThinkUp demonstrates this. We need to be making apps that people actually want, not ones that are just open. "Are you being more attentive to what users want than Mark Zuckerberg is?" We need to shepherd and coach the apps that want to do the right thing. We count on 23 yr olds to do this, but they were in 5th grade when the environment was open. It's very hard to learn the history of the personal software industry and how it impacted culture. "What happened in the desktop office suite wars ?" [Ah, memories!] We should be learning from such things.

And we can learn things from our own data. “It’s much easier for me to check my heart-rate than how often I’m reading Twitter.”

Fortunately, there are still institutions that care about a healthy Web. At one point there was a conflict between federal law and Terms of Service: the White House was archiving coments on its FB wall, whereas FB said you couldn’t archive for more than 24 hrs.

We should remember that ToS isn’t law. Geeks will hack software but treat ToS as sacred. Our culture is negatively impacted by ToS and we should reclaim our agency over them. “We should think about how to organize action around specific clauses in ToS.” In fact, “people have already chosen a path of civil disobedience.” E.g., search YouTube for “no infringement intended.” “It’s like poetry.” They’re saying “I’m not trying to step on your toes, but the world needs to see this.” “I’m so inspired by this.” If millions of teenagers assembled to engage in civil disobedience, we’d be amazed. They do on line. They feel they need to transgress because of a creative urge, or because it’s speech with a friend not an act of publishing. “That’s the opportunity. That’s the exciting part. People are doing this every single day.

[I couldn't capture the excellent Q&A because I was running the microphone around.]

 


The video of the talk will be posted here.

35 Comments »

March 28, 2013

[annotation][2b2k] Mediathread

Jonah Bossewich and Mark Philipsonfrom Columbia University talk about Mediathread, an open source project that makes it easy to annotate various digital sources. It’s used in many courses at Columbi, as well as around the world.

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

It comes from Columbia’s Center for New Media Teaching and Learning. It began with Vital, a video library tool. It let students clip and save portions of videos, and comment on them. Mediathread connects annotations to sources by bookmarking, via a bookmarklet that interoperates with a variety of collections. The bookmarklet scrapes the metadata because “We couldn’t wait for the standards to be developed.” Once an item is in Mediathread, it embeds the metadata as well.

It has always been conceived of a “small-group sharing and collaboration space.” It’s designed for classes. You can only see the annotations by people in your class. It does item-level annotation, as well as regions.

Mediathread connects assignments and responses, as well as other workflows. [He's talking quickly :)]

Mediathread’s bookmarklet approach requires it to have to accommodate the particularities of sites. They are aiming at making the annotations interoperable in standard forms.

Be the first to comment »

[annotation][2b2k] Paolo Ciccarese on the Domeo annotation platform

Paolo Ciccarese begins by reminding us just how vast the scientific literature is. We can’t possibly read everything we should. But “science is social” so we rely on each other, and build on each other’s work. “Everything we do now is connected.”

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

Today’s media do provide links, but not enough. Things are so deeply linked. “How do we keep track of it?” How do we communicate with others so that when they read the same paper they get a little bit of our mental model, and see why we found the article interesting?

Paolo’s project — Domeo [twitter:DomeoTool] — is a web app for “producing, browsing, and sharing manual and semi-automatic (structure and unstructured) annotations, using open standards. Domeo shows you an article and lets you annotate fragments. You can attach a tag or an unstructured comment. The tag can be defined by the user or by a defined ontology. Domeo doesn’t care which ontologies you use, which means you could use it for annotating recipes as well as science articles.

Domeo also enables discussions; it has a threaded msg facility. You can also run text mining and entity recognition systems (Calais, etc.) that automatically annotates the work with those words, which helps with search, understanding, and curation. This too can be a social process. Domeo lets you keep the annotation private or share it with colleagues, groups, communities, or the Web. Also, Domeo can be extended. In one example, it produces information about experiments that can be put into a database where it can be searched and linked up with other experiments and articles. Another example: “hypothesis management” lets readers add metadata to pick out the assertions and the evidence. (It uses RDF) You can visualize the network of knowledge.

It supports open APIs for integrating with other systems., including into the Neuroscience Information Framework and Drupal. “Domeo is a platform.” It aims at supporting rich source, and will add the ability to follow authors and topics, etc., and enabling mashups.

Be the first to comment »

[annotation][2b2k] Neel Smith: Scholarly annotation + Homer

Neel Smith of Holy Cross is talking about the Homer Multitext project, a “long term project to represent the transmission of the Iliad in digital form.”

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

He shows the oldest extant ms of the Iliad, which includes 10th century notes. “The medieval scribes create a wonderful hypermedia” work.

“Scholarly annotation starts with citation.” He says we have a good standard: URNs, which can point to, for example, and ISBN number. His project uses URNs to refer to texts in a FRBR-like hierarchy [works at various levels of abstraction]. These are semantically rich and machine-actionable. You can google URN and get the object. You can put a URN into a URL for direct Web access. You can embed an image into a Web page via its URN [using a service, I believe].

An annotation is an association. In a scholarly notation, it’s associated with a citable entity. [He shows some great examples of the possibilities of cross linking and associating.]

The metadata is expressed as RDF triples. Within the Homer project, they’re inductively building up a schema of the complete graph [network of connections]. For end users, this means you can see everything associated with a particular URN. Building a facsimile browser, for example, becomes straightforward, mainly requiring the application of XSL and CSS to style it.

Another example: Mise en page: automated layout analysis. This in-progress project analyzes the layout of annotation info on the Homeric pages.

1 Comment »

[annotations][2b2k] Rob Sanderson on annotating digitized medieval manuscripts

Rob Sanderson [twitter:@azaroth42] of Los Alamos is talking about annotating Medieval manuscripts.

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

He says many Medieval manuscripts are being digitized. The Mellon Foundation is funding many such projects. But these have tended to reinvent the same tech, and have not been designed for interoperability with other projects. So the Digital Medieval Initiative was founded, with a long list of prestigious partners. They thought about what they’d like: distributed, linked data, interoperable, etc. For this they need a shared description format.

The traditional approach is annotate an image of a page. But it can be very difficult to know which images to annotate; he gives as an example a page that has fold-outs. “The naive assuption is that an image equals a page.” But there may be fragments, or only portions of the page have been digitized (e.g., the illuminations), etc. There may be multiple images on a page, revealed by multi-spectral imaging. There may be multiple orientations of the page, etc.

The solution? The canvas paradigm. A canvas is an empty space corresponding to the rectangle (or whatever) of the page. You allow rich resources to be associated with it, and allow users to comment. For this, they use Open Annotation. You can specify a choice of images. You can associate text with an area of the canvas. There are lots of different ways to visualize those comments: overlays, side-by-side, etc.

You can build hybrid pages. For example, and old scan might have a new color scan of its illustrations pointing at it. Or you could have a recorded performance of a piece of music pointing at the musical notation.

In summary, the SharedCanvas model uses open standards (HTML 5, Open Annotation, TEI, etc.) and can be implement distributed across reporsitories, encouraging engagement by domain experts.

Be the first to comment »

[annotation][2b2k] Philip Desenne

I’m at a workshop on annotation at Harvard. Philip Desenne is giving one of the keynotes.

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

We’re here to talk about the Web 3.0, Phil says — making the Web more fully semantic.

Phil says that we need to re-write the definition of annotation. We should be talking about hyper-nota: digital media-rich annotations. Annotations are important, he says. Try to imagine social networks with the ratings, stars, comments, etc. Annotations also spawn new scholarship.

The new dew digital annotation paradigm is the gateway to Web 3.0: connecting knowledge through a common semantic language. There are many annotation tools out there. “All are very good in their own media…But none of them share a common model to interoperate.” That’s what we’re going to work on today. “The Open Annotation Framework” is the new digital paradigm. But it’s not a simple model because it’s a complex framework. Phil shows a pyramid: Create / Search / Seek patterns / Analyze / Publish / Share. [Each of these has multiple terms and ideas that I didn't have time to type out.]

Of course we need to abide by open standards. He points to W3C, Open Source and Creative Commons. And annotations need to include multimedia notes. We need to be able to see annotations relating to one another, building networks across the globe. [Knowledge networks FTW!] Hierarchies of meaning allow for richer connections. We can analyze text and other media and connect that metadata. We can look across regional and cultural patterns. We can publish, share and collaborate. All if we have a standard framework.

For this to happeb we beed a standardized referencing system for segments or fragments of a work. We also need to be able to export them into standard formats such as XML TEI.

Lots of work has been done on this: RDF Models and Ontologies, the Open Annotiation Community Group, the Open Annotation Model. “The Open Annotation Model is the common language.”

If we don’t adopt standards for annotation we’ll have disassociated, stagnant info. We’ll dereased innovation research, teaching, and learning knowledge. This is especially an issue when one thinks about MOOCs — a course with 150,000 students creating annotations.

Connective Collective Knowledge has existed for millennia he says. As far back as Aristarchus, marginalia had ymbols to allow pointing to different scrolls in the Library of Alexandria. Where are the connected collective knowledge systems today? Who is networking the commentaries on digital works? “Shouldn’t this be the mission of the 21st century library?”

Harvard has a portal for info about annotations: annotations.harvard.edu

2 Comments »

February 5, 2013

[berkman] Diana Kimball: Coding as a Liberal Art

Diana Kimball [twitter:dianakimball] is giving a Berkman lunchtime talk on coding as a liberal art. She’s a Berkman Fellow and at the Harvard Business School. (Here are some of her posts on this topic.)

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

She says that she’s loved computers since she was a kid. But when she went to Harvard as an undergrad she decided to study history, in part because there’s a natural specialization that happens in college: the students who come in as coders are fantastic at coding, whereas Diana had greater strengths as a writer of prose. She found HTML and programming intimidating. But in her third year, she got interested in coding and Internet culture. She was one of the founders of ROFLcon [yay!]. She got hired by Microsoft after college, as a technical product manager with the Powerpoint team in Silicon Valley. “This was culture shock in the best possible way.”

When she graduated in 2009, she and some friends started found the SnarkMarket blog that considers what the new liberal arts might be (inspired by Kottke). She wrote an essay that’s a proposal for coding and decoding. She reads it. (It’s short.) An excerpt:

Coding and Decoding is about all modes of communication, and all are in its view. But it is built with particular attention to the future, and what that future will be like. Technological experts can seem like magicians, conjuring effects wordlessly. By approaching that magic as a collection of component parts instead of an indivisible miracle, we can learn to see through these sleights of typing hands. In seeing through, we will learn to perform them ourselves; and think, as magicians, about the worlds we will build.

Language, now, is about more than communication. It is the architecture behind much of what we experience. Understanding that architecture will allow us to experience more.

Her boyfriend taught her how to code. They spent a lot of time on it. “He picked up on something I’d said and took it seriously.” After two years at Microsoft, she was enthusiastic, but still a beginner. It wasn’t until she started at Harvard Business School that coding really took off for her. The entrepreneurial atmosphere encouraged her to just do it. Plus, she was more of a geek than most of the other students. “This was great for my identity, and for my confidence.” She also found it a social refuge. “It takes a lot of time to get over the hump.” She refers to Ellen Ullman’s “Close to the Machine” that talks about the utility of being arrogant enough to obsess over a project, cycling back to humility.

She decided to code up her own site for a project for school, even though the team had been given the money to hire devs for the task. Last fall she took the famous CS50 course [Harry Lewis, who created the course in about 1981, is sitting next to me.] CS50 teaches C, targeted at people who are either taking only that one class, or are going to take many more. For her final project, she did a project that used multiple APIs that she was very proud of. She’s also proud of her Ruby projects folder. Each project is something she was trying to teach herself. She’s more proud of the list than the finished products.

“Learning to code means reclaiming patience and persistence and making them your stubborn own.” [nice]

Ideally, everyone should be exposed to programming, starting at 5 yrs old, or even earlier, Diana says. Seymore Papert’s “Mind-Storms” has greatly influenced her thinking about how coding fits into education and citizenship. At a university, it ought to be taken as a liberal art. She quotes Wikipedia’s definition. And if “grammar, rhetoric, and logic were the core of the liberal arts,” then that’s sound like coding. [Hmm.] What the law was to the liberal arts, programming ought to be, i.e., that which you try if you don’t know what else to do with your liberal arts degree.

Why isn’t it seen that way? When computer scientists teach you, they teach they way they learned: at school. But many of the best programmers are self-taught. CS50 does give a variety of assignments, but it’d be better if students solved their own problems much earlier.

But the number one problem is the academic attitude, she says. Students get fixated on the grade, even when it doesn’t matter. Coding is critical for children because debugging is part of it, as Papert says. But grades are based on the endpoints. Coding is much more like life: You’re never done, you can always make it better.

Diana has a proposal. Suppose coding classes were taught like creative writing workshops. Take it whenever you’re ready. Taught by hackers, esepcially autodidacts. It’d vary in substance — algorithms, apis, etc. — and you’d get to choose. You’d get to see something on screen that you’d never seen before And you’d be evaluated on ingenuity and persistence, rather than only on how well your code runs.

She says what her syllabus would look like:

“Coding should be taught in the same breath as expository writing… Everyone deserves to be exposed to it.” She’s not sure if it should be required.

She quotes Papert: “…the most powerful idea of all is the idea of powerful ideas.” There’s no better example of this, she says, than open source software. And David Foster Wallace’s commencement address: “Learning how to think really means learning to exercise some control over how and what you think…If you cannot exercise this sort of choice in adult life, you will be totally hosed.” Diana says that’s her. She was wrapped up in writing from an early age. She has a running internal commentary. [Join the club!] Coding swaps in a different monologue, one in which she’s inventing thing. That’s the greatest gift: her internal monologue is much more useful and interesting. “If you wanted to be a novelist in 1900, you’d want to be a programmer today.” The experience of creating something that people use is so motivating.

Q&A

Q: Would you be willing to webcast yourself programming and let people join in? I do this all the time when at hackathons. I think, OMG, there must be 10,000 kids in India who want to be here. And so here they are. “Hackers at Berkeley” does this really well.

A: That’s awesome. I want more people to have more access to that experience of sharing.

Q: Are you familiar with RailsBridge — non-computer scientists who are teaching themselves how to code via weekend workshops.

A: RailsBridge is extraordinary. It’s great to see this happening outside of the university context.

A: [me] Great talk, and I’m a humanities major who spends most of his hobby time programming. But aren’t you recommending the thing that you happen to love? And programming as opposed to traditional logic is an arbitrary set of rules…

Q: Yes, but it would be really useful if more people loved it. We could frame it in a way that is exciting for humanities majors. I’m proposing an idea rather than making an airtight argument. “You’re basically right but I don’t really care” (she says laughing :).

Q: I like your idea of teaching it like a writers workshop so that it doesn’t turn into just another course. But I’m not sure that colleges are the best at doing that.

A: not everyone loves programming.

Q: [harry lewis] I take responsibility for eliminating the Harvard requirement for a programming course. Also, take a look at code.org. Third, the academic world treats computer science the way it does because of our disciplinary specialization. That label — computer science — came about in the context of fields like political science, and arose when computers were used not for posting web sites but for putting people on the Moon where a bug could kill someone. The fact that CompSci exists in academic departments will make it very difficult for your vision of computing to exist, just as creative writing is often an uneasy fit into English curricula.

A: That’s very fair. I know it’d be hard. RIT has separate depts for CompSci and coding.

Q: There’s an emergent exploration of coding in Arts schools, with a much more nimble, plug and play approach, very similar to the one you describe. My question: What do the liberal arts have to offer coding? Much of coding is quite new, e.g., open source. These could be understood within a historical context. Maybe these need to be nurtured, explored, broken. Does seeing coding as a liberal art have something to offer sw development?

A: ITP is maybe the best example of artists working with coders. Liberal Arts can teach programmers so much!

Q: Can we celebrate failure? That’d be a crucial part of any coding workshop.

A: Yes! Maybe “find the most interesting bug” and reward introspection about where you’ve gone wrong. But it’s hard in a class like CS50 where you’re evaluating results.

Q: This is known as egoless programming. It’s 40 years old, from Weinberger [no relation].

Q: You’re making a deeper point, which is not just about coding. The important thing is not the knowledge you get, but the way you get there. Being self-reflective about you came about how you learn. You can do this with code but with anything.

A: You’re so right. Introspection about the meta-level of learning is not naturally part of a course. But Ruby is an introspective language: you can ask any object what it is, and it will tell you. This is a great mirror for trying to know yourself better.

Q: What would you pick to teach?

A: I love Ruby. It would be a good choice because there’s a supportive community so students can learn on their own afterwards, and it’s an introspective language. And the lack of ornament in Ruby (no curly braces and little punctuation) makes it much more like English. The logic is much more visible. (My preference is Sinatra, not Rails.)

Q: What sort of time commitment the average person would have to put in to have a basic grasp of a programming language? Adults vs. children learning it?

A: I’d love to see research on this. [Audience: Rottmeyers, CMU (?)] A friend of mine reported he spent 20 hours. The learning curve is very halting at first. It’s hard to teach yourself. It helps to have a supportive in-person environment. CS50 is a 10-20 hour commitment/week and who has that sort of time except for fulltime students? To teach yourself, start out a few hours a time.

Q: How about where the MOOCs are going? Can you do a massively online course in compSci that would capture some of what you’re talking about?

A: The field is so focused on efficiency that MOOCs seem like an obvious idea. I think that a small workshop is the right way to start. CS50 requires so much fear of failure and resilience that it wouldn’t have been a good way for me to start. At CS50, you can’t let others read your code.

Q: We shouldn’t put together Computer Science and programming. Programming is just a layer of expression on top of computer science. You don’t need compSci to become a programmer. And the Net is the new computer; we’re gluing together services from across the Net. That will change how people think about programming because eveyrone will be able to do it. The first language everyone should learn is ifttt.com

Q: I’m a NY Times journalist. I love languages. And I love the analogy you draw. I’m 30. Do you think coding is really essential? Would it open my eyes as a journalist?

A: It’s never too late. If you keep asking the question, you should probably do it. You don’t have to be good at it to get a lot out of it. It’s so cool that your children are learning multiple languages including coding. Learn alongside them.

7 Comments »

Next Page »