Vectoring in Research

35
Vectoring in Research CS 197 | Stanford University | Michael Bernstein

Transcript of Vectoring in Research

Page 1: Vectoring in Research

Vectoring in Research

CS 197 | Stanford University | Michael Bernstein

Page 2: Vectoring in Research

AdministriviaNext week: how to give a talk, by Prof. Kayvon FatahalianTime to dig in to your projects?

2

Page 3: Vectoring in Research

What problem are we solving?

3

“I’m feeling so lost.”

“I thought of an important reason

that this won’t work.”“It’s not working yet. I’m not sure that we’re making progress.”

“But how do we start?”

Page 4: Vectoring in Research

Today’s big idea: vectoringWhat is vectoring?How do we vector effectively?What goes wrong if we don’t vector?

4

Page 5: Vectoring in Research

Bernstein theory of faculty successTo be a Stanford-tier faculty member, you need to master two skills that operate in a tight loop with one another.Vectoring: identifying the biggest dimension of risk in your project right now

Velocity: rapid reduction of risk in the chosen dimension

5

not today!

today

Page 6: Vectoring in Research

What Is Vectoring?

Page 7: Vectoring in Research

What research is not1. Figure out what to do.2. Do it.3. Publish.

7

What research isResearch is an iterative process of

exploration, not a linear path from idea to result [Gowers 2000]

Page 8: Vectoring in Research

Problematic points of view

“OK, we have a good idea. Let’s build it / model it / prove it / get training data.”

8

“I spent some time thinking about this and hacking on it, and it’s not going to work: it has a fatal flaw.”

Treating your research goal as a project spec and executing it

Page 9: Vectoring in Research

Idea as project spec

9

Concept Result

work work work work work work

Taking a concept and trying to realize it in parallel across all decisions, assumptions, and goals

Page 10: Vectoring in Research

Idea as project spec

10

What you didWhat you should have done

This is the endpoint of a research project

This is all other points of a research project

[Buxton 2007]

Page 11: Vectoring in Research

Problematic points of view

“OK, we have a good idea. Let’s build it / model it / prove it / get training data.”

11

“I spent some time thinking about this and hacking on it, and it’s not going to work: it has a fatal flaw.”

…before knowing what to refine!

…before identifying if that test or flaw is the right one to focus on!

….

Page 12: Vectoring in Research

Pick a vectorIt may feel like we get stuck unable to solve the problem because we haven’t figured out everything else about it. There are too many open questions, and too many possible directions. The more dimensions there are, the harder gradient descent becomes.Instead of doing trying to do everything at once (project spec), pick one dimension of uncertainty — one vector — and focus on reducing its risk and uncertainty.

12

Page 13: Vectoring in Research
Page 14: Vectoring in Research

Example vectorsPiloting: will this technique work at all? To answer this, we implement a basic version of the technique and mock in the data and other test harness elements.Engineering: will this technique work with a realistic workload? To answer this, we need to engineer a test harness.Proving: does the limit exist that I suspect does? To answer this, we start by writing a proof for a simpler case.Design: what might this interaction look like to an end user? To answer this, we create a low-fi prototype.

14

Page 15: Vectoring in Research

ImplicationsThe vectors under consideration will each imply building different parts of your system. Rather than building them all at once, when you might have to change things later, vectoring instead implies that you start by reducing uncertainty in the most important dimension first — your “inner loop” — and then building out from there.

15

Page 16: Vectoring in Research

Vectoring algorithm

16

1. Generate questions Untested hunches, risky decisions,high-level directions

2. Rank your questions Which is most critical?

3. Pick one and answer it rapidly Answer only the most critical question(This is where velocity comes into play)

Page 17: Vectoring in Research

Assumption mappingAssumption mapping is a strategy for articulating questions and ranking them.

17

Unknown

Unimportant

Important

Known

Try assumption mapping your project [5min]

Page 18: Vectoring in Research

Let’s Try It

Page 19: Vectoring in Research

TrollingWhile everyone thinks that trolling online is due to a small number of antisocial sociopaths, we had a hunch that “normal” people were responsible for much trolling behavior when triggered.What’s our first step?We have: dataset of 16M CNN comments (w/ troll flags), Mechanical Turk for studies 19

Page 20: Vectoring in Research

TrollingPossible vectors:

Do people really troll when pissed off?Can we train a classifier to predict when someone would troll, and compare weights of personal history vs. other posts and title?Does the same person troll more on certain (angry) topics than on other (boring) ones? 20

Page 21: Vectoring in Research

TeamingWe wanted to create an algorithm that would weave collaboration networks to help spread ideas over time by moving people from team to team.What’s our first step?

21

Page 22: Vectoring in Research

TeamingPossible vectors:

Do new members with new perspectives actually exert influence in practice?If we prioritize or de-prioritize membership rotation in a simple (greedy) algorithm, does it lead to different outcomes in the collaboration network?

22

Page 23: Vectoring in Research

LearningWe thought that, in domains where ML still cannot succeed, we could draw on crowdsourcing to identify human-labeled predictive features. In other words, that people are great at identifying potentially informative features, but might be poor at weighing those features correctly to arrive at a prediction.What’s our first step?

23

Page 24: Vectoring in Research

LearningPossible vectors:

Can people identify predictive features for a single domain, e.g., lie detection?Can people estimate which features are going to be informative?Would a hybrid classifier (human features and labels as input to an ML model) actually perform well?

24

Page 25: Vectoring in Research

Why is vectoring so important?

Page 26: Vectoring in Research

“ I f Er nes t Hemingway, J ames Mitchener, Neil Simon, Frank Lloyd Wright, and Pablo Picasso could not get it right the first time, what makes you think that you will?”

— Paul Heckel

Page 27: Vectoring in Research

Iteration >> planningIdeas rarely land exactly where you expect they will. It’s best to test the most critical assumptions quickly, so that you can understand whether your hunch will play out, and what problems are worth spending time solving vs. kludging.Human creative work is best in a loop of reflection and iteration. Vectoring is a way to make sure you’re getting the most iteration cycles.

27

Page 28: Vectoring in Research

Re-vectoringOften, after vectoring and reducing uncertainty in one dimension, it raises new questions and uncertainties.In the next round of vectoring, you re-prioritize:

If you get unexpected results and are confused (most of the time!), maybe it means you take a new angle to reduce uncertainty on a vector related to the prior one.If you answer your question to your own satisfaction (not completely, just to your satisfaction), you move on to the next most important vector

28

Page 29: Vectoring in Research

Magnitude of your vectorThe result of vectoring should be something achievable in about a week’s sprint. If it’s not, you’ve picked too broad a question to answer.

If your vectoring for “Can normal people be responsible for a lot of the trolling online?” is “Can normal people be responsible for a lot of the trolling on CNN.com?”, you’re still way too broad.That’s evidence that you’ve just rescaled your project, not picked a vector.

29

Page 30: Vectoring in Research

Takeaways, in brief

Page 31: Vectoring in Research

1) The temptation is to try and solve the problem that’s set in front of you. Don’t.

Page 32: Vectoring in Research

2) Vectoring is a process of identifying the dimension of highest impact+uncertainty, and prioritizing that dimension while scaffolding the others

Page 33: Vectoring in Research

3) Successful vectoring enables you to rapidly hone in on the core insight of your research project

Page 34: Vectoring in Research

Assignment 4At this point, your project transitions to a state where your team is working to try and achieve the goal you set out in Assignment 3.Each week for the next several weeks, your team will perform vectoring, submit a brief summary and slide, and report in section:

This week’s vectorThis week’s planThis week’s resultNext week’s vectorNext week’s plan 34

Page 35: Vectoring in Research

Slide content shareable under a Creative Commons Attribution-NonCommercial 4.0 International License.

35

Vectoring in Research