The Django Book User Manual

User Manual: Pdf

Open the PDF directly: View PDF .
Page Count: 424 [warning: Documents this large are best viewed by clicking the View PDF Link!]

About this book
- Acknowledgements
- About the Authors
- About the Technical Reviewer
Introduction
Chapter 1: Introduction to Django
- What Is a Web Framework?
- The MVC Design Pattern
- Django's History
- How to Read This Book
Chapter 2: Getting Started
- Installing Python
- Installing Django
- Testing the Django installation
- Setting Up a Database
- Starting a Project
- What's Next?
Chapter 3: Views and URLconfs
- Your First Django-Powered Page: Hello World
- How Django Processes a Request
- Your Second View: Dynamic Content
- URLconfs and Loose Coupling
- Your Third View: Dynamic URLs
- Django's Pretty Error Pages
- What's next?
Chapter 4: Templates
- Template System Basics
- Using the Template System
- Basic Template Tags and Filters
- Philosophies and Limitations
- Using Templates in Views
- Template Loading
- Template Inheritance
- What's next?
Chapter 5: Models
- The ``Dumb'' Way to Do Database Queries in Views
- The MTV (or MVC) Development Pattern
- Configuring the Database
- Your First App
- Defining Models in Python
- Your First Model
- Installing the Model
- Basic Data Access
- Adding Model String Representations
- Inserting and Updating Data
- Selecting Objects
- Deleting Objects
- What's Next?
Chapter 6: The Django Admin Site
- The django.contrib packages
- Activating the Admin Interface
- Using the Admin Site
- Adding Your Models to the Admin Site
- How the Admin Site Works
- Making Fields Optional
- Customizing Field Labels
- Custom ModelAdmin classes
- Users, Groups, and Permissions
- When and Why to Use the Admin Interface – And When Not to
- What's Next?
Chapter 7: Forms
- Getting Data From the Request Object
- A Simple Form-Handling Example
- Improving Our Simple Form-Handling Example
- Simple validation
- Making a Contact Form
- Your First Form Class
- Tying Form Objects Into Views
- Changing How Fields Are Rendered
- Setting a Maximum Length
- Setting Initial Values
- Custom Validation Rules
- Specifying labels
- Customizing Form Design
- What's Next?
Chapter 8: Advanced Views and URLconfs
- URLconf Tricks
- Including Other URLconfs
- What's Next?
Chapter 9: Advanced Templates
- Template Language Review
- RequestContext and Context Processors
- Automatic HTML Escaping
- Inside Template Loading
- Extending the Template System
- Writing Custom Template Loaders
- Configuring the Template System in Standalone Mode
- What's Next
Chapter 10: Advanced Models
- Related Objects
- Making Changes to a Database Schema
- Managers
- Model methods
- Executing Raw SQL Queries
- What's Next?
Chapter 11: Generic Views
- Using Generic Views
- Generic Views of Objects
- Extending Generic Views
- What's Next?
Chapter 12: Deploying Django
- Preparing Your Codebase for Production
- Using Different Settings for Production
- DJANGO_SETTINGS_MODULE
- Using Django with Apache and mod_python
- Using Django with FastCGI
- Scaling
- Performance Tuning
- What's Next?
Chapter 13: Generating Non-HTML Content
- The basics: views and MIME-types
- Producing CSV
- Generating PDFs
- Other Possibilities
- The Syndication Feed Framework
- The Sitemap Framework
- What's Next?
Chapter 14: Sessions, Users, and Registration
- Cookies
- Django's Session Framework
- Users and Authentication
- Permissions, Groups and Messages
- What's Next
Chapter 15: Caching
- Setting Up the Cache
- The Per-Site Cache
- The Per-View Cache
- Template Fragment Caching
- The Low-Level Cache API
- Upstream Caches
- Using Vary Headers
- Controlling Cache: Using Other Headers
- Other Optimizations
- Order of MIDDLEWARE_CLASSES
- What's Next?
Chapter 16: django.contrib
- The Django Standard Library
- Sites
- Flatpages
- Redirects
- CSRF Protection
- Humanizing Data
- Markup Filters
- What's Next?
Chapter 17: Middleware
- What's Middleware?
- Middleware Installation
- Middleware Methods
- Built-in Middleware
- What's Next?
Chapter 18: Integrating with Legacy Databases and Applications
- Integrating with a Legacy Database
- Integrating with an Authentication System
- Integrating with Legacy Web Applications
- What's Next?
Chapter 19: Internationalization
- 1. How to Specify Translation Strings
- 2. How to Create Language Files
- 3. How Django Discovers Language Preference
- Using Translations in Your Own Projects
- The set_language Redirect View
- Translations and JavaScript
- Notes for Users Familiar with gettext
- gettext on Windows
- What's Next?
Chapter 20: Security
- The Theme of Web Security
- SQL Injection
- Cross-Site Scripting (XSS)
- Cross-Site Request Forgery
- Session Forging/Hijacking
- E-mail Header Injection
- Directory Traversal
- Exposed Error Messages
- A Final Word on Security
- What's Next?
Appendix A: Model Definition Reference
- Fields
- Universal Field Options
- Relationships
- Model Metadata Options
Appendix B: Database API Reference
- Creating Objects
- Saving Changes to Objects
- Retrieving Objects
- Caching and QuerySets
- Filtering Objects
- Field Lookups
- Complex Lookups with Q Objects
- Related Objects
- Deleting Objects
- Shortcuts
- Falling Back to Raw SQL
Appendix C: Generic View Reference
- Common Arguments to Generic Views
- ``Simple'' Generic Views
- List/Detail Generic Views
- Date-Based Generic Views
Appendix D: Settings
- What's a Settings File?
- Designating the Settings: DJANGO_SETTINGS_MODULE
- Using Settings Without Setting DJANGO_SETTINGS_MODULE
- Available Settings
Appendix E: Built-in Template Tags and Filters
- Built-in Tag Reference
- Built-in Filter Reference
Appendix F: The django-admin Utility
- Usage
- Available subcommands
- Default options
- Extra niceties
Appendix G: Request and Response Objects
- HttpRequest
- HttpResponse
License & Copyright
- GNU Free Documentation License

The Django Book

Release 2.0

Adrian Holovaty, Jacob Kaplan-Moss, et al.

May 08, 2014

Contents

1 About this book 3

1.1 Acknowledgements ........................................... 3

1.2 About the Authors ............................................ 4

1.3 About the Technical Reviewer ...................................... 4

2 Introduction 5

3 Chapter 1: Introduction to Django 7

3.1 What Is a Web Framework? ....................................... 7

3.2 The MVC Design Pattern ........................................ 8

3.3 Django’s History ............................................. 10

3.4 How to Read This Book ......................................... 11

4 Chapter 2: Getting Started 13

4.1 Installing Python ............................................. 13

4.2 Installing Django ............................................. 14

4.3 Testing the Django installation ...................................... 16

4.4 Setting Up a Database .......................................... 16

4.5 Starting a Project ............................................. 18

4.6 What’s Next? ............................................... 20

5 Chapter 3: Views and URLconfs 21

5.1 Your First Django-Powered Page: Hello World ............................. 21

5.2 How Django Processes a Request .................................... 27

5.3 Your Second View: Dynamic Content .................................. 27

5.4 URLconfs and Loose Coupling ..................................... 29

5.5 Your Third View: Dynamic URLs .................................... 30

5.6 Django’s Pretty Error Pages ....................................... 33

5.7 What’s next? ............................................... 34

6 Chapter 4: Templates 35

6.1 Template System Basics ......................................... 35

6.2 Using the Template System ....................................... 36

6.3 Basic Template Tags and Filters ..................................... 43

6.4 Philosophies and Limitations ...................................... 49

6.5 Using Templates in Views ........................................ 50

6.6 Template Loading ............................................ 51

6.7 Template Inheritance ........................................... 55

6.8 What’s next? ............................................... 58

7 Chapter 5: Models 61

7.1 The “Dumb” Way to Do Database Queries in Views .......................... 61

7.2 The MTV (or MVC) Development Pattern ............................... 62

7.3 Conﬁguring the Database ........................................ 63

7.4 Your First App .............................................. 65

7.5 Deﬁning Models in Python ........................................ 66

7.6 Your First Model ............................................. 67

7.7 Installing the Model ........................................... 68

7.8 Basic Data Access ............................................ 71

7.9 Adding Model String Representations .................................. 72

7.10 Inserting and Updating Data ....................................... 73

7.11 Selecting Objects ............................................. 74

7.12 Deleting Objects ............................................. 79

7.13 What’s Next? ............................................... 80

8 Chapter 6: The Django Admin Site 81

8.1 The django.contrib packages ....................................... 81

8.2 Activating the Admin Interface ..................................... 81

8.3 Using the Admin Site .......................................... 82

8.4 Adding Your Models to the Admin Site ................................. 89

8.5 How the Admin Site Works ....................................... 89

8.6 Making Fields Optional ......................................... 90

8.7 Customizing Field Labels ........................................ 92

8.8 Custom ModelAdmin classes ...................................... 92

8.9 Users, Groups, and Permissions ..................................... 102

8.10 When and Why to Use the Admin Interface – And When Not to .................... 102

8.11 What’s Next? ............................................... 103

9 Chapter 7: Forms 105

9.1 Getting Data From the Request Object ................................. 105

9.2 A Simple Form-Handling Example ................................... 107

9.3 Improving Our Simple Form-Handling Example ............................ 110

9.4 Simple validation ............................................. 111

9.5 Making a Contact Form ......................................... 113

9.6 Your First Form Class .......................................... 116

9.7 Tying Form Objects Into Views ..................................... 118

9.8 Changing How Fields Are Rendered ................................... 119

9.9 Setting a Maximum Length ....................................... 120

9.10 Setting Initial Values ........................................... 120

9.11 Custom Validation Rules ......................................... 120

9.12 Specifying labels ............................................. 121

9.13 Customizing Form Design ........................................ 121

9.14 What’s Next? ............................................... 123

10 Chapter 8: Advanced Views and URLconfs 125

10.1 URLconf Tricks ............................................. 125

10.2 Including Other URLconfs ........................................ 139

10.3 What’s Next? ............................................... 142

11 Chapter 9: Advanced Templates 143

11.1 Template Language Review ....................................... 143

11.2 RequestContext and Context Processors ................................. 144

11.3 Automatic HTML Escaping ....................................... 148

11.4 Inside Template Loading ......................................... 150

11.5 Extending the Template System ..................................... 151

11.6 Writing Custom Template Loaders ................................... 160

11.7 Conﬁguring the Template System in Standalone Mode ......................... 161

11.8 What’s Next ............................................... 161

12 Chapter 10: Advanced Models 163

12.1 Related Objects .............................................. 163

12.2 Making Changes to a Database Schema ................................. 165

12.3 Managers ................................................. 167

12.4 Model methods .............................................. 170

12.5 Executing Raw SQL Queries ....................................... 171

12.6 What’s Next? ............................................... 171

13 Chapter 11: Generic Views 173

13.1 Using Generic Views ........................................... 173

13.2 Generic Views of Objects ........................................ 174

13.3 Extending Generic Views ........................................ 176

13.4 What’s Next? ............................................... 180

14 Chapter 12: Deploying Django 181

14.1 Preparing Your Codebase for Production ................................ 181

14.2 Using Different Settings for Production ................................. 183

14.3 DJANGO_SETTINGS_MODULE ................................... 185

14.4 Using Django with Apache and mod_python .............................. 185

14.5 Using Django with FastCGI ....................................... 188

14.6 Scaling .................................................. 193

14.7 Performance Tuning ........................................... 198

14.8 What’s Next? ............................................... 199

15 Chapter 13: Generating Non-HTML Content 201

15.1 The basics: views and MIME-types ................................... 201

15.2 Producing CSV .............................................. 202

15.3 Generating PDFs ............................................. 203

15.4 Other Possibilities ............................................ 205

15.5 The Syndication Feed Framework .................................... 205

15.6 The Sitemap Framework ......................................... 211

15.7 What’s Next? ............................................... 215

16 Chapter 14: Sessions, Users, and Registration 217

16.1 Cookies .................................................. 217

16.2 Django’s Session Framework ...................................... 219

16.3 Users and Authentication ........................................ 224

16.4 Permissions, Groups and Messages ................................... 232

16.5 What’s Next ............................................... 234

17 Chapter 15: Caching 235

17.1 Setting Up the Cache ........................................... 235

17.2 The Per-Site Cache ............................................ 238

17.3 The Per-View Cache ........................................... 239

17.4 Template Fragment Caching ....................................... 240

17.5 The Low-Level Cache API ........................................ 241

17.6 Upstream Caches ............................................. 242

17.7 Using Vary Headers ........................................... 243

17.8 Controlling Cache: Using Other Headers ................................ 244

17.9 Other Optimizations ........................................... 245

17.10 Order of MIDDLEWARE_CLASSES .................................. 245

iii

17.11 What’s Next? ............................................... 246

18 Chapter 16: django.contrib 247

18.1 The Django Standard Library ...................................... 247

18.2 Sites ................................................... 248

18.3 Flatpages ................................................. 253

18.4 Redirects ................................................. 256

18.5 CSRF Protection ............................................. 257

18.6 Humanizing Data ............................................. 259

18.7 Markup Filters .............................................. 260

18.8 What’s Next? ............................................... 260

19 Chapter 17: Middleware 263

19.1 What’s Middleware? ........................................... 263

19.2 Middleware Installation ......................................... 264

19.3 Middleware Methods ........................................... 264

19.4 Built-in Middleware ........................................... 266

19.5 What’s Next? ............................................... 268

20 Chapter 18: Integrating with Legacy Databases and Applications 269

20.1 Integrating with a Legacy Database ................................... 269

20.2 Integrating with an Authentication System ............................... 271

20.3 Integrating with Legacy Web Applications ............................... 272

20.4 What’s Next? ............................................... 273

21 Chapter 19: Internationalization 275

21.1 1. How to Specify Translation Strings .................................. 276

21.2 2. How to Create Language Files .................................... 280

21.3 3. How Django Discovers Language Preference ............................. 282

21.4 Using Translations in Your Own Projects ................................ 284

21.5 The set_language Redirect View .................................. 285

21.6 Translations and JavaScript ....................................... 286

21.7 Notes for Users Familiar with gettext ................................ 287

21.8 gettext on Windows ......................................... 288

21.9 What’s Next? ............................................... 288

22 Chapter 20: Security 289

22.1 The Theme of Web Security ....................................... 289

22.2 SQL Injection .............................................. 289

22.3 Cross-Site Scripting (XSS) ........................................ 291

22.4 Cross-Site Request Forgery ....................................... 292

22.5 Session Forging/Hijacking ........................................ 292

22.6 E-mail Header Injection ......................................... 294

22.7 Directory Traversal ............................................ 294

22.8 Exposed Error Messages ......................................... 295

22.9 A Final Word on Security ........................................ 296

22.10 What’s Next? ............................................... 296

23 Appendix A: Model Deﬁnition Reference 297

23.1 Fields ................................................... 297

23.2 Universal Field Options ......................................... 302

23.3 Relationships ............................................... 305

23.4 Model Metadata Options ......................................... 307

24 Appendix B: Database API Reference 311

24.1 Creating Objects ............................................. 311

24.2 Saving Changes to Objects ........................................ 313

24.3 Retrieving Objects ............................................ 314

24.4 Caching and QuerySets ......................................... 314

24.5 Filtering Objects ............................................. 315

24.6 Field Lookups .............................................. 322

24.7 Complex Lookups with Q Objects .................................... 325

24.8 Related Objects .............................................. 326

24.9 Deleting Objects ............................................. 330

24.10 Shortcuts ................................................. 330

24.11 Falling Back to Raw SQL ........................................ 331

25 Appendix C: Generic View Reference 333

25.1 Common Arguments to Generic Views ................................. 333

25.2 “Simple” Generic Views ......................................... 334

25.3 List/Detail Generic Views ........................................ 335

25.4 Date-Based Generic Views ........................................ 338

26 Appendix D: Settings 349

26.1 What’s a Settings File? .......................................... 349

26.2 Designating the Settings: DJANGO_SETTINGS_MODULE ...................... 350

26.3 Using Settings Without Setting DJANGO_SETTINGS_MODULE ................... 351

26.4 Available Settings ............................................ 353

27 Appendix E: Built-in Template Tags and Filters 365

27.1 Built-in Tag Reference .......................................... 365

27.2 Built-in Filter Reference ......................................... 376

28 Appendix F: The django-admin Utility 389

28.1 Usage ................................................... 389

28.2 Available subcommands ......................................... 390

28.3 Default options .............................................. 398

28.4 Extra niceties ............................................... 399

29 Appendix G: Request and Response Objects 401

29.1 HttpRequest ............................................... 401

29.2 HttpResponse ............................................... 406

30 License & Copyright 411

30.1 GNU Free Documentation License ................................... 411

The Django Book, Release 2.0

Welcome to the online version of The Django Book, a free book about the Django Web framework for the Python

programming language.

A warning about this edition

The community edition of The Django Book is in transition. While the book mentions Django version 1.4 in places,

the vast majority of the book is for Django version 1.0, which was released over four years ago. Therefore this resource

is extremely out of date and, until the book is ﬁnished being updated, we ask that, at this time, djangobook.com not

be used for educational purposes.

This book was originally published by Apress in 2009 and covered Django version 1.0. Since then it has languished

and, in places, is extremely out of date. We are working on getting the book updated to cover Django versions 1.4,

1.5, and beyond. But we need your help, we’re making this book — warts and all — open source in the hopes that it

will ﬁnd love as a community project.

So, if you’d like to help out, hop on over to GitHub and send us a pull request!

Contents 1

The Django Book, Release 2.0

2 Contents

CHAPTER 1

About this book

You’re reading The Django Book, ﬁrst published in December 2007 (and updated in 2009) by Apress as The Deﬁnitive

Guide to Django: Web Development Done Right.

We’ve released this book freely for a couple of reasons. First, we love Django and we want it to be as accessible as

possible. Many programmers learn their craft from well-written technical material, so we set out to create a top-notch

guide and reference to Django.

Second, it turns out that writing books about technology is fundamentally difﬁcult: your words are often outdated

before the book even reaches the printer. On the web, however, “the ink is never dry” — we can (and will!) keep the

book updated.

1.1 Acknowledgements

The most gratifying aspect of working on Django is the community. We’ve been especially lucky that Django has

attracted such a smart, motivated, and friendly bunch. A segment of that community followed us over to the online

“beta” release of this book. Their reviews and comments were indispensable; this book wouldn’t have been possible

without all that wonderful peer review. Almost a thousand people left comments that helped us improve the clarity,

quality, and ﬂow of the ﬁnal book; we’d like to thank each and every one of them.

We’re especially grateful to those who took the time to review the book in depth and left dozens (sometimes hundreds)

of comments apiece: Marty Alchin, Max Battcher, Oliver Beat- tie, Rod Begbie, Paul Bissex, Matt Boersma, Robbin

Bonthond, Peter Bowyer, Nesta Campbell, Jon Colverson, Jeff Croft, Chris Dary, Alex Dong, Matt Drew, Robert

Dzikowski, Nick Efford, Ludvig Ericson, Eric Floehr, Brad Fults, David Grant, Simon Greenhill, Robert Have-

man, Kent Johnson, Andrew Kember, Marek Kubica, Eduard Kucera, Anand Kumria, Scott Lamb, Fredrik Lundh,

Vadim Macagon, Markus Majer, Orestis Markou, R. Mason, Yasushi Masuda, Kevin Menard, Carlo Miron, James

Mulholland, R.D. Nielsen, Michael O’Keefe, Lawrence Oluyede, Andreas Pfrengle, Frankie Robertson, Mike Robin-

son, Armin Ronacher, Daniel Roseman, Johan Samyn, Ross Shannon, Carolina F. Silva, Paul Smith, Björn Stabell,

Bob Stepno, Graeme Stevenson, Justin Stockton, Kevin Teague, Daniel Tietze, Brooks Travis, Peter Tripp, Matthias

Urlichs, Peter van Kampen, Alexandre Vassalotti, Jay Wang, Brian Will, and Joshua Works.

Many thanks to our technical editor, Jeremy Dunck. Without Jeremy this book would be littered with errors, inaccu-

racies, and broken code. We feel very lucky that someone as talented as Jeremy found the time to help us out.

Specials thanks go to Simon Willison for writing the chapter on form processing. We really appreciate the help, and

we’re thrilled that Simon’s excellent writing can be part of this book.

We’re grateful for all the hard work the folks at Apress put into this book. They’ve been amazingly supportive and

patient; this book wouldn’t have come together without a lot of work on their part. We’re especially happy that Apress

supported and even encouraged the free release of this book online; it’s wonderful seeing a publisher so embracing the

spirit of open source.

The Django Book, Release 2.0

Finally, of course, thanks to our friends, families, and coworkers who’ve graciously tolerated our mental absence while

we ﬁnished this work.

1.2 About the Authors

Adrian Holovaty is a Web developer with a background in journalism. He’s known in journalism circles as one of the

pioneers of “journalism via computer programming”, and in technical circles as “the guy who invented Django.”

He was lead developer at World Online for 2.5 years, during which time Django was developed and implemented on

World Online’s sites. He’s the founder of EveryBlock, a “news feed for your block”.

Adrian lives in Chicago, USA.

Jacob Kaplan-Moss is a partner at Revolution Systems which provides support services around Django and related

open source technologies. Jacob previously worked at World Online, where Django was invented, where he was the

lead developer of Ellington, a commercial Web publishing platform for media companies.

Jacob lives in Lawrence, Kansas, USA.

1.3 About the Technical Reviewer

Jeremy Dunck was rescued from corporate IT drudgery by Free Software and, in part, Django. Many of Jeremy’s

interests center around access to information.

Jeremy was the lead developer of Pegasus News, one of the ﬁrst uses of Django outside World Online, and has since

joined Votizen, a startup intent on reducing the inﬂuence of money in politics.

He serves as DSF Secretary, organizes and helps organize sprints, cares about the health and equity of the Django

community. He has gone an embarrassingly long time without a working blog.

Jeremy lives in Mountain View, CA, USA.

4 Chapter 1. About this book

CHAPTER 2

Introduction

In the early days, Web developers wrote every page by hand. Updating a Web site meant editing HTML; a “redesign”

involved redoing every single page, one at a time.

As Web sites grew and became more ambitious, it quickly became obvious that that situation was tedious, time-

consuming, and ultimately untenable. A group of enterprising hackers at NCSA (the National Center for Supercom-

puting Applications, where Mosaic, the ﬁrst graphical Web browser, was developed) solved this problem by letting the

Web server spawn external programs that could dynamically generate HTML. They called this protocol the Common

Gateway Interface, or CGI, and it changed the Web forever.

It’s hard now to imagine what a revelation CGI must have been: instead of treating HTML pages as simple ﬁles on

disk, CGI allows you to think of your pages as resources generated dynamically on demand. The development of CGI

ushered in the ﬁrst generation of dynamic Web sites.

However, CGI has its problems: CGI scripts need to contain a lot of repetitive “boilerplate” code, they make code

reuse difﬁcult, and they can be difﬁcult for ﬁrst-time developers to write and understand.

PHP ﬁxed many of these problems, and it took the world by storm – it’s now by far the most popular tool used to

create dynamic Web sites, and dozens of similar languages and environments (ASP, JSP, etc.) followed PHP’s design

closely. PHP’s major innovation is its ease of use: PHP code is simply embedded into plain HTML; the learning curve

for someone who already knows HTML is extremely shallow.

But PHP has its own problems; its very ease of use encourages sloppy, repetitive, ill-conceived code. Worse, PHP does

little to protect programmers from security vulnerabilities, and thus many PHP developers found themselves learning

about security only once it was too late.

These and similar frustrations led directly to the development of the current crop of “third-generation” Web devel-

opment frameworks. These frameworks – Django and Ruby on Rails appear to be the most popular these days –

recognize that the Web’s importance has escalated of late. With this new explosion of Web development comes yet

another increase in ambition; Web developers are expected to do more and more every day.

Django was invented to meet these new ambitions. Django lets you build deep, dynamic, interesting sites in an

extremely short time. Django is designed to let you focus on the fun, interesting parts of your job while easing the pain

of the repetitive bits. In doing so, it provides high-level abstractions of common Web development patterns, shortcuts

for frequent programming tasks, and clear conventions on how to solve problems. At the same time, Django tries to

stay out of your way, letting you work outside the scope of the framework as needed. We wrote this book because

we ﬁrmly believe that Django makes Web development better. It’s designed to quickly get you moving on your own

Django projects, and then ultimately teach you everything you need to know to successfully design, develop, and

deploy a site that you’ll be proud of.

We’re extremely interested in your feedback. This book is open source and all are welcome to improve it. If you

prefer to suggest changes, please drop us a line at feedback@djangobook.com. Either way, we’d love to hear from

you! We’re glad you’re here, and we hope that you ﬁnd Django as exciting, fun and useful as we do.

The Django Book, Release 2.0

6 Chapter 2. Introduction

CHAPTER 3

Chapter 1: Introduction to Django

This book is about Django, a Web development framework that saves you time and makes Web development a joy.

Using Django, you can build and maintain high-quality Web applications with minimal fuss.

At its best, Web development is an exciting, creative act; at its worst, it can be a repetitive, frustrating nuisance.

Django lets you focus on the fun stuff – the crux of your Web application – while easing the pain of the repetitive

bits. In doing so, it provides high-level abstractions of common Web development patterns, shortcuts for frequent

programming tasks, and clear conventions for how to solve problems. At the same time, Django tries to stay out of

your way, letting you work outside the scope of the framework as needed.

The goal of this book is to make you a Django expert. The focus is twofold. First, we explain, in depth, what

Django does and how to build Web applications with it. Second, we discuss higher-level concepts where appropriate,

answering the question “How can I apply these tools effectively in my own projects?” By reading this book, you’ll

learn the skills needed to develop powerful Web sites quickly, with code that is clean and easy to maintain.

3.1 What Is a Web Framework?

Django is a prominent member of a new generation of Web frameworks – but what does that term mean, precisely?

To answer that question, let’s consider the design of a Web application written in Python without a framework.

Throughout this book, we’ll take this approach of showing you basic ways of getting work done without shortcuts, in

the hope that you’ll recognize why shortcuts are so helpful. (It’s also valuable to know how to get things done without

shortcuts because shortcuts aren’t always available. And most importantly, knowing why things work the way they do

makes you a better Web developer.)

One of the simplest, most direct ways to build a Python Web app from scratch is to use the Common Gateway Interface

(CGI) standard, which was a popular technique circa 1998. Here’s a high-level explanation of how it works: just create

a Python script that outputs HTML, then save the script to a Web server with a ”.cgi” extension and visit the page in

your Web browser. That’s it.

Here’s an example Python CGI script that displays the ten most recently published books from a database. Don’t

worry about syntax details; just get a feel for the basic things it’s doing:

#!/usr/bin/env python

import MySQLdb

print "Content-Type: text/html\n"

print "<html><head><title>Books</title></head>"

print "<body>"

print "<h1>Books</h1>"

print "<ul>"

The Django Book, Release 2.0

connection =MySQLdb.connect(user=’me’, passwd=’letmein’, db=’my_db’)

cursor =connection.cursor()

cursor.execute("SELECT name FROM books ORDER BY pub_date DESC LIMIT 10")

for row in cursor.fetchall():

print "<li>%s</li>" %row[0]

print "</ul>"

print "</body></html>"

connection.close()

First, to fulﬁll the requirements of CGI, this code prints a “Content-Type” line, followed by a blank line. It prints some

introductory HTML, connects to a database and runs a query to retrieve the names of the latest ten books. Looping

over those books, it generates an HTML list of the titles. Finally, it prints the closing HTML and closes the database

connection.

With a one-off page like this one, the write-it-from-scratch approach isn’t necessarily bad. For one thing, this code is

simple to comprehend – even a novice developer can read these 16 lines of Python and understand everything it does,

from start to ﬁnish. There’s nothing else to learn, no other code to read. It’s also simple to deploy: just save this code

in a ﬁle that ends with ”.cgi”, upload that ﬁle to a Web server, and visit that page with a browser.

But despite its simplicity, this approach has a number of problems and annoyances. Ask yourself these questions:

• What happens when multiple parts of your application need to connect to the database? Surely that database-

connecting code shouldn’t need to be duplicated in each individual CGI script. The pragmatic thing to do would

be to refactor it into a shared function.

• Should a developer really have to worry about printing the “Content-Type” line and remembering to close the

database connection? This sort of boilerplate reduces programmer productivity and introduces opportunities for

mistakes. These setup- and teardown-related tasks would best be handled by some common infrastructure.

• What happens when this code is reused in multiple environments, each with a separate database and password?

At this point, some environment-speciﬁc conﬁguration becomes essential.

• What happens when a Web designer who has no experience coding Python wishes to redesign the page? One

wrong character could crash the entire application. Ideally, the logic of the page – the retrieval of book titles

from the database – would be separate from the HTML display of the page, so that a designer could edit the

latter without affecting the former.

These problems are precisely what a Web framework intends to solve. A Web framework provides a programming

infrastructure for your applications, so that you can focus on writing clean, maintainable code without having to

reinvent the wheel. In a nutshell, that’s what Django does.

3.2 The MVC Design Pattern

Let’s dive in with a quick example that demonstrates the difference between the previous approach and a Web

framework’s approach. Here’s how you might write the previous CGI code using Django. The ﬁrst thing to

note is that that we split it over four Python ﬁles (models.py,views.py,urls.py) and an HTML template

(latest_books.html):

# models.py (the database tables)

from django.db import models

class Book(models.Model):

name = models.CharField(max_length=50)

8 Chapter 3. Chapter 1: Introduction to Django

The Django Book, Release 2.0

pub_date = models.DateField()

# views.py (the business logic)

from django.shortcuts import render

from models import Book

def latest_books(request):

book_list = Book.objects.order_by(’-pub_date’)[:10]

return render(request, ’latest_books.html’, {’book_list’: book_list})

# urls.py (the URL configuration)

from django.conf.urls.defaults import *

import views

urlpatterns = patterns(’’,

(r’^latest/$’, views.latest_books),

)

# latest_books.html (the template)

<html><head><title>Books</title></head>

<body>

<h1>Books</h1>

<ul>

{% for book in book_list %}

{% endfor %}

</ul>

</body></html>

Again, don’t worry about the particulars of syntax; just get a feel for the overall design. The main thing to note here is

the separation of concerns:

• The models.py ﬁle contains a description of the database table, represented by a Python class. This class

is called a model. Using it, you can create, retrieve, update and delete records in your database using simple

Python code rather than writing repetitive SQL statements.

• The views.py ﬁle contains the business logic for the page. The latest_books() function is called a view.

• The urls.py ﬁle speciﬁes which view is called for a given URL pattern. In this case, the URL /latest/

will be handled by the latest_books() function. In other words, if your domain is example.com, any visit

to the URL http://example.com/latest/ will call the latest_books() function.

• The latest_books.html ﬁle is an HTML template that describes the design of the page. It uses a template

language with basic logic statements – e.g., {% for book in book_list %}.

Taken together, these pieces loosely follow a pattern called Model-View-Controller (MVC). Simply put, MVC is way

of developing software so that the code for deﬁning and accessing data (the model) is separate from request-routing

logic (the controller), which in turn is separate from the user interface (the view). (We’ll discuss MVC in more depth

in Chapter 5.)

A key advantage of such an approach is that components are loosely coupled. Each distinct piece of a Django-powered

Web application has a single key purpose and can be changed independently without affecting the other pieces. For

example, a developer can change the URL for a given part of the application without affecting the underlying imple-

3.2. The MVC Design Pattern 9

The Django Book, Release 2.0

mentation. A designer can change a page’s HTML without having to touch the Python code that renders it. A database

administrator can rename a database table and specify the change in a single place, rather than having to search and

replace through a dozen ﬁles.

In this book, each component of MVC gets its own chapter. Chapter 3 covers views, Chapter 4 covers templates, and

Chapter 5 covers models.

3.3 Django’s History

Before we dive into more code, we should take a moment to explain Django’s history. We noted above that we’ll be

showing you how to do things without shortcuts so that you more fully understand the shortcuts. Similarly, it’s useful

to understand why Django was created, because knowledge of the history will put into context why Django works the

way it does.

If you’ve been building Web applications for a while, you’re probably familiar with the problems in the CGI example

we presented earlier. The classic Web developer’s path goes something like this:

1. Write a Web application from scratch.

2. Write another Web application from scratch.

3. Realize the application from step 1 shares much in common with the application from step 2.

4. Refactor the code so that application 1 shares code with application 2.

5. Repeat steps 2-4 several times.

6. Realize you’ve invented a framework.

This is precisely how Django itself was created!

Django grew organically from real-world applications written by a Web development team in Lawrence, Kansas,

USA. It was born in the fall of 2003, when the Web programmers at the Lawrence Journal-World newspaper, Adrian

Holovaty and Simon Willison, began using Python to build applications.

The World Online team, responsible for the production and maintenance of several local news sites, thrived in a

development environment dictated by journalism deadlines. For the sites – including LJWorld.com, Lawrence.com

and KUsports.com – journalists (and management) demanded that features be added and entire applications be built

on an intensely fast schedule, often with only days’ or hours’ notice. Thus, Simon and Adrian developed a time-saving

Web development framework out of necessity – it was the only way they could build maintainable applications under

the extreme deadlines.

In summer 2005, after having developed this framework to a point where it was efﬁciently powering most of World

Online’s sites, the team, which now included Jacob Kaplan-Moss, decided to release the framework as open source

software. They released it in July 2005 and named it Django, after the jazz guitarist Django Reinhardt.

Now, several years later, Django is a well-established open source project with tens of thousands of users and con-

tributors spread across the planet. Two of the original World Online developers (the “Benevolent Dictators for Life,”

Adrian and Jacob) still provide central guidance for the framework’s growth, but it’s much more of a collaborative

team effort.

This history is relevant because it helps explain two key things. The ﬁrst is Django’s “sweet spot.” Because Django

was born in a news environment, it offers several features (such as its admin site, covered in Chapter 6) that are

particularly well suited for “content” sites – sites like Amazon.com, craigslist.org, and washingtonpost.com that offer

dynamic, database-driven information. Don’t let that turn you off, though – although Django is particularly good for

developing those sorts of sites, that doesn’t preclude it from being an effective tool for building any sort of dynamic

Web site. (There’s a difference between being particularly effective at something and being ineffective at other things.)

The second matter to note is how Django’s origins have shaped the culture of its open source community. Because

Django was extracted from real-world code, rather than being an academic exercise or commercial product, it is acutely

10 Chapter 3. Chapter 1: Introduction to Django

The Django Book, Release 2.0

focused on solving Web development problems that Django’s developers themselves have faced – and continue to face.

As a result, Django itself is actively improved on an almost daily basis. The framework’s maintainers have a vested

interest in making sure Django saves developers time, produces applications that are easy to maintain and performs

well under load. If nothing else, the developers are motivated by their own selﬁsh desires to save themselves time and

enjoy their jobs. (To put it bluntly, they eat their own dog food.)

3.4 How to Read This Book

In writing this book, we tried to strike a balance between readability and reference, with a bias toward readability. Our

goal with this book, as stated earlier, is to make you a Django expert, and we believe the best way to teach is through

prose and plenty of examples, rather than providing an exhaustive but bland catalog of Django features. (As the saying

goes, you can’t expect to teach somebody how to speak a language merely by teaching them the alphabet.)

With that in mind, we recommend that you read Chapters 1 through 12 in order. They form the foundation of how

to use Django; once you’ve read them, you’ll be able to build and deploy Django-powered Web sites. Speciﬁcally,

Chapters 1 through 7 are the “core curriculum,” Chapters 8 through 11 cover more advanced Django usage, and

Chapter 12 covers deployment. The remaining chapters, 13 through 20, focus on speciﬁc Django features and can be

read in any order.

The appendixes are for reference. They, along with the free documentation at http://www.djangoproject.com/, are

probably what you’ll ﬂip back to occasionally to recall syntax or ﬁnd quick synopses of what certain parts of Django

do.

3.4.1 Required Programming Knowledge

Readers of this book should understand the basics of procedural and object-oriented programming: control structures

(e.g., if,while,for), data structures (lists, hashes/dictionaries), variables, classes and objects.

Experience in Web development is, as you may expect, very helpful, but it’s not required to understand this book.

Throughout the book, we try to promote best practices in Web development for readers who lack this experience.

3.4.2 Required Python Knowledge

At its core, Django is simply a collection of libraries written in the Python programming language. To develop a site

using Django, you write Python code that uses these libraries. Learning Django, then, is a matter of learning how to

program in Python and understanding how the Django libraries work.

If you have experience programming in Python, you should have no trouble diving in. By and large, the Django

code doesn’t perform a lot of “magic” (i.e., programming trickery whose implementation is difﬁcult to explain or

understand). For you, learning Django will be a matter of learning Django’s conventions and APIs.

If you don’t have experience programming in Python, you’re in for a treat. It’s easy to learn and a joy to use! Although

this book doesn’t include a full Python tutorial, it highlights Python features and functionality where appropriate,

particularly when code doesn’t immediately make sense. Still, we recommend you read the ofﬁcial Python tutorial,

available online at http://docs.python.org/tut/. We also recommend Mark Pilgrim’s free book Dive Into Python, avail-

able at http://www.diveintopython.net/ and published in print by Apress.

3.4.3 Required Django Version

This book covers Django 1.4.

3.4. How to Read This Book 11

The Django Book, Release 2.0

Django’s developers maintain backwards compatibility as much as possible, but occasionally introduce some back-

wards incompatible changes. The changes in each release are always covered in the release notes, which you can ﬁnd

here: https://docs.djangoproject.com/en/dev/releases/1.X

3.4.4 Getting Help

One of the greatest beneﬁts of Django is its kind and helpful user community. For help with any aspect of Django –

from installation, to application design, to database design, to deployment – feel free to ask questions online.

• The django-users mailing list is where thousands of Django users hang out to ask and answer questions. Sign

up for free at http://www.djangoproject.com/r/django-users.

• The Django IRC channel is where Django users hang out to chat and help each other in real time. Join the fun

by logging on to #django on the Freenode IRC network.

3.4.5 What’s Next

In Chapter 2, we’ll get started with Django, covering installation and initial setup.

12 Chapter 3. Chapter 1: Introduction to Django

CHAPTER 4

Chapter 2: Getting Started

Installing Django is a multi-step process, due to the multiple moving parts in modern Web development environments.

In this chapter, we’ll walk you through how to install the framework and its few dependencies.

Because Django is “just” Python code, it runs anywhere Python does – including on some cell phones! But this chapter

just covers the common scenarios for Django installations. We’ll assume you’re installing it either on a desktop/laptop

machine or a server.

Later, in Chapter 12, we’ll cover how to deploy Django to a production site.

4.1 Installing Python

Django itself is written purely in Python, so the ﬁrst step in installing the framework is to make sure you have Python

installed.

4.1.1 Python Versions

The core Django framework (version 1.4+) works with any Python version from 2.5 to 2.7, inclusive. Django’s optional

GIS (Geographic Information Systems) support requires Python 2.5 to 2.7.

If you’re not sure which version of Python to install and you have complete freedom over the decision, pick the latest

one in the 2.x series: version 2.7. Although Django works equally well with any version from 2.5 to 2.7, the later

versions of Python have performance improvements and additional language features you might like to use in your

applications. Plus, certain third-party Django add-ons that you might want to use might require a version newer than

Python 2.5, so using a later version of Python keeps your options open.

Django and Python 3.x

At the time of writing, Python 3.3 has been released, but Django only supports it experimentally. This is because the

Python 3.x series introduces a substantial number of backwards-incompatible changes to the language itself, and, as a

result, many major Python libraries and frameworks, including Django (as of version 1.4), have not yet caught up.

Django 1.5 is slated to support Python 2.6, 2.7, and 3.2. However, support for Python 3.2 is considered a “preview”,

which means the Django developers are not yet conﬁdent enough to promise stability in production. For that, they

suggest you wait until Django 1.6.

The Django Book, Release 2.0

4.1.2 Installation

If you’re on Linux or Mac OS X, you probably have Python already installed. Type python at a command prompt

(or in Applications/Utilities/Terminal, in OS X). If you see something like this, then Python is installed:

Python 2.7.3rc2 (default, Apr 22 2012, 22:30:17)

[GCC 4.6.3] on linux2

Type "help", "copyright", "credits" or "license" for more information.

>>>

Otherwise, you’ll need to download and install Python. It’s fast and easy, and detailed instructions are available at

http://www.python.org/download/

4.2 Installing Django

At any given time, two distinct versions of Django are available to you: the latest ofﬁcial release and the bleeding-edge

development version. The version you decide to install depends on your priorities. Do you want a stable and tested

version of Django, or do you want a version containing the latest features, perhaps so you can contribute to Django

itself, at the expense of stability?

We’d recommend sticking with an ofﬁcial release, but it’s important to know that the development version exists,

because you’ll ﬁnd it mentioned in the documentation and by members of the community.

4.2.1 Installing an Ofﬁcial Release

Ofﬁcial releases have a version number, such as 1.4.2, 1.4.1 or 1.4, and the latest one is always available at

http://www.djangoproject.com/download/.

If you’re on a Linux distribution that includes a package of Django, it’s a good idea to use the distributor’s version.

That way, you’ll get security updates along with the rest of your system packages.

If you don’t have access to a prepackaged version, you can download and install the framework manually. To do so,

ﬁrst download the tarball, which will be named something like Django-1.4.2.tar.gz. (It doesn’t matter which

local directory you download this ﬁle into; the installation process will put Django’s ﬁles in the right place.) Then,

unzip it and run setup.py install, as you do with most Python libraries.

Here’s how that process looks on Unix systems:

1. tar xzvf Django-1.4.2.tar.gz

2. cd Django-*

3. sudo python setup.py install

On Windows, we recommend using 7-Zip (http://www.djangoproject.com/r/7zip/) to unzip .tar.gz ﬁles. Once

you’ve unzipped the ﬁle, start up a DOS shell (the “Command Prompt”) with administrator privileges and run the

following command from within the directory whose name starts with Django-:

python setup.py install

In case you’re curious: Django’s ﬁles will be installed into your Python installation’s site-packages

directory – a directory where Python looks for third-party libraries. Usually it’s in a place like

/usr/lib/python2.7/site-packages.

14 Chapter 4. Chapter 2: Getting Started

The Django Book, Release 2.0

4.2.2 Installing the “Development” Version

Django uses Git (http://git-scm.com) for its source control. The latest and greatest Django development version avail-

able from Django’s ofﬁcial Git repository (https://github.com/django/django). You should consider installing this

version if you want to work on the bleeding edge, or if you want to contribute code to Django itself.

Git is a free, open source distributed revision-control system, and the Django team uses it to manage changes to the

Django codebase. You can download and install Git from http://git-scm.com/download but it is easier to install with

your operating system’s package manager. You can use Git to grab the very latest Django source code and, at any

given time, you can update your local version of the Django code to get the latest changes and improvements made by

Django developers.

When using the development version, keep in mind there’s no guarantee things won’t be broken at any given moment.

With that said, though, some members of the Django team run production sites on the development version, so they

have an incentive to keep it stable.

To grab the latest Django, follow these steps:

1. Make sure you have Git installed. You can get the software free from http://git-scm.com/, and you can ﬁnd

excellent documentation at http://git-scm.com/documentation.

2. Clone the repository using the command git clone https://github.com/django/django

djmaster

3. Locate your Python installation’s site-packages directory. Usually it’s in a place like

/usr/lib/python2.7/site-packages. If you have no idea, type this command from a command

prompt:

python -c ’import sys, pprint; pprint.pprint(sys.path)’

The resulting output should include your site-packages directory.

4. Within the site-packages directory, create a ﬁle called djmaster.pth and edit it to contain the full path

to your djmaster directory to it. For example, the ﬁle could just contain this line:

/path/to/djmaster

5. Place djmaster/django/bin on your system PATH. This directory includes management utilities such as

django-admin.py.

Tip:

If .pth ﬁles are new to you, you can learn more about them at http://www.djangoproject.com/r/python/site-module/.

After downloading from Git and following the preceding steps, there’s no need to run python setup.py

install– you’ve just done the work by hand!

Because the Django code changes often with bug ﬁxes and feature additions, you’ll probably want to update it ev-

ery once in a while. To update the code, just run the command git pull origin master from within the

djmaster directory. When you run that command, Git will contact https://github.com/django/django, determine

whether any of Django’s code has changed, and update your local version of the code with any changes that have been

made since you last updated. It’s quite slick.

Finally, if you use Django development version, you should know how to ﬁgure out which version of Django you’re

running. Knowing your version number is important if you ever need to reach out to the community for help, or if you

submit improvements to the framework. In these cases, you should tell people the revision, also known as a “commit,”

that you’re using. To ﬁnd out your current commit, type “git log -1” from within the django directory, and look for

the identiﬁer after “commit”. This number changes each time Django is changed, whether through a bug ﬁx, feature

addition, documentation improvement or anything else.

4.2. Installing Django 15

The Django Book, Release 2.0

4.3 Testing the Django installation

For some post-installation positive feedback, take a moment to test whether the installation worked. In a command

shell, change into another directory (e.g., not the directory that contains the django directory) and start the Python

interactive interpreter by typing python. If the installation was successful, you should be able to import the module

django:

>>> import django

>>> django.VERSION

(1, 4, 2, ’final’, 0)

Interactive Interpreter Examples

The Python interactive interpreter is a command-line program that lets you write a Python program interactively. To

start it, run the command python at the command line.

Throughout this book, we feature example Python interactive interpreter sessions. You can recognize these examples

by the triple greater-than signs (>>>), which designate the interpreter’s prompt. If you’re copying examples from this

book, don’t copy those greater-than signs.

Multiline statements in the interactive interpreter are padded with three dots (...). For example:

>>> print """This is a

... string that spans

... three lines."""

This is a

string that spans

three lines.

>>> def my_function(value):

... print value

>>> my_function(’hello’)

hello

Those three dots at the start of the additional lines are inserted by the Python shell – they’re not part of our input. We

include them here to be faithful to the actual output of the interpreter. If you copy our examples to follow along, don’t

copy those dots.

4.4 Setting Up a Database

At this point, you could very well begin writing a Web application with Django, because Django’s only hard-and-fast

prerequisite is a working Python installation. However, odds are you’ll be developing a database-driven Web site, in

which case you’ll need to conﬁgure a database server.

If you just want to start playing with Django, skip ahead to the “Starting a Project” section – but keep in mind that all

the examples in this book assume you have a working database set up.

Django supports four database engines:

• PostgreSQL (http://www.postgresql.org/)

• SQLite 3 (http://www.sqlite.org/)

• MySQL (http://www.mysql.com/)

• Oracle (http://www.oracle.com/)

For the most part, all the engines here work equally well with the core Django framework. (A notable exception is

Django’s optional GIS support, which is much more powerful with PostgreSQL than with other databases.) If you’re

16 Chapter 4. Chapter 2: Getting Started

The Django Book, Release 2.0

not tied to any legacy system and have the freedom to choose a database backend, we recommend PostgreSQL, which

achieves a ﬁne balance between cost, features, speed and stability.

Setting up the database is a two-step process:

• First, you’ll need to install and conﬁgure the database server itself. This process is beyond the scope of this

book, but each of the four database backends has rich documentation on its Web site. (If you’re on a shared

hosting provider, odds are that they’ve set this up for you already.)

• Second, you’ll need to install the Python library for your particular database backend. This is a third-party bit

of code that allows Python to interface with the database. We outline the speciﬁc, per-database requirements in

the following sections.

If you’re just playing around with Django and don’t want to install a database server, consider using SQLite. SQLite is

unique in the list of supported databases in that it doesn’t require either of the above steps. It merely reads and writes

its data to a single ﬁle on your ﬁlesystem, and Python versions 2.5 and higher include built-in support for it.

On Windows, obtaining database driver binaries can be frustrating. If you’re eager to jump in, we recommend using

Python 2.7 and its built-in support for SQLite.

4.4.1 Using Django with PostgreSQL

If you’re using PostgreSQL, you’ll need to install either the psycopg or psycopg2 package from

http://www.djangoproject.com/r/python-pgsql/. We recommend psycopg2, as it’s newer, more actively developed

and can be easier to install. Either way, take note of whether you’re using version 1 or 2; you’ll need this information

later.

If you’re using PostgreSQL on Windows, you can ﬁnd precompiled binaries of psycopg at

http://www.djangoproject.com/r/python-pgsql/windows/.

If you’re on Linux, check whether your distribution’s package-management system offers a package called “python-

psycopg2”, “psycopg2-python”, “python-postgresql” or something similar.

4.4.2 Using Django with SQLite 3

You’re in luck: no database-speciﬁc installation is required, because Python ships with SQLite support. Skip ahead to

the next section.

4.4.3 Using Django with MySQL

Django requires MySQL 4.0 or above. The 3.x versions don’t support nested subqueries and some other fairly standard

SQL statements.

You’ll also need to install the MySQLdb package from http://www.djangoproject.com/r/python-mysql/.

If you’re on Linux, check whether your distribution’s package-management system offers a package called “python-

mysql”, “python-mysqldb”, “mysql-python” or something similar.

4.4.4 Using Django with Oracle

Django works with Oracle Database Server versions 9i and higher.

If you’re using Oracle, you’ll need to install the cx_Oracle library, available at http://cx-oracle.sourceforge.net/.

Use version 4.3.1 or higher, but avoid version 5.0 due to a bug in that version of the driver. Version 5.0.1 resolved the

bug, however, so you can choose a higher version as well.

4.4. Setting Up a Database 17

The Django Book, Release 2.0

4.4.5 Using Django Without a Database

As mentioned earlier, Django doesn’t actually require a database. If you just want to use it to serve dynamic pages

that don’t hit a database, that’s perfectly ﬁne.

With that said, bear in mind that some of the extra tools bundled with Django do require a database, so if you choose

not to use a database, you’ll miss out on those features. (We highlight these features throughout this book.)

4.5 Starting a Project

Once you’ve installed Python, Django and (optionally) your database server/library, you can take the ﬁrst step in

developing a Django application by creating a project.

A project is a collection of settings for an instance of Django, including database conﬁguration, Django-speciﬁc

options and application-speciﬁc settings.

If this is your ﬁrst time using Django, you’ll have to take care of some initial setup. Create a new directory to start

working in, perhaps something like /home/username/djcode/.

Where Should This Directory Live?

If your background is in PHP, you’re probably used to putting code under the Web server’s document root (in a place

such as /var/www). With Django, you don’t do that. It’s not a good idea to put any of this Python code within your

Web server’s document root, because in doing so you risk the possibility that people will be able to view your raw

source code over the Web. That’s not good.

Put your code in some directory outside of the document root.

Change into the directory you created, and run the command django-admin.py startproject mysite.

This will create a mysite directory in your current directory.

Note: django-admin.py should be on your system path if you installed Django via its setup.py utility.

If you’re using the development version, you’ll ﬁnd django-admin.py in djmaster/django/bin.

Because you’ll be using django-admin.py often, consider adding it to your system path. On

Unix, you can do so by symlinking from /usr/local/bin, using a command such as sudo ln -s

/path/to/django/bin/django-admin.py /usr/local/bin/django-admin.py. On Windows,

you’ll need to update your PATH environment variable.

If you installed Django from a packaged version for your Linux distribution, django-admin.py might be called

django-admin instead.

If you see a “permission denied” message when running django-admin.py startproject, you’ll need to

change the ﬁle’s permissions. To do this, navigate to the directory where django-admin.py is installed (e.g., cd

/usr/local/bin) and run the command chmod +x django-admin.py.

The startproject command creates a directory containing ﬁve ﬁles:

mysite/

manage.py

mysite/

__init__.py

settings.py

urls.py

wsgi.py

18 Chapter 4. Chapter 2: Getting Started

The Django Book, Release 2.0

Note: Doesn’t match what you see?

The default project layout recently changed. If you’re seeing a “ﬂat” layout (with no inner mysite/ directory), you’re

probably using a version of Django that doesn’t match this tutorial version. This book covers Django 1.4 and above,

so if you’re using an older version you probably want to consult Django’s ofﬁcial documentation.

The documentation for Django 1.X version is available at https://docs.djangoproject.com/en/1.X/.

These ﬁles are as follows:

•mysite/: The outer mysite/ directory is just a container for your project. Its name doesn’t matter to Django;

you can rename it to anything you like.

•manage.py: A command-line utility that lets you interact with this Django project in various ways. Type

python manage.py help to get a feel for what it can do. You should never have to edit this ﬁle; it’s

created in this directory purely for convenience.

•mysite/mysite/: The inner mysite/ directory is the actual Python package for your project. Its name is

the Python package name you’ll need to use to import anything inside it (e.g. import mysite.settings).

•__init__.py: A ﬁle required for Python to treat the mysite directory as a package (i.e., a group of Python

modules). It’s an empty ﬁle, and generally you won’t add anything to it.

•settings.py: Settings/conﬁguration for this Django project. Take a look at it to get an idea of the types of

settings available, along with their default values.

•urls.py: The URLs for this Django project. Think of this as the “table of contents” of your Django-powered

site.

•wsgi.py: An entry-point for WSGI-compatible webservers to serve your project. See How to deploy with

WSGI (https://docs.djangoproject.com/en/1.4/howto/deployment/wsgi/) for more details.

Despite their small size, these ﬁles already constitute a working Django application.

4.5.1 Running the Development Server

For some more post-installation positive feedback, let’s run the Django development server to see our barebones

application in action.

The Django development server (also called the “runserver” after the command that launches it) is a built-in,

lightweight Web server you can use while developing your site. It’s included with Django so you can develop your

site rapidly, without having to deal with conﬁguring your production server (e.g., Apache) until you’re ready for pro-

duction. The development server watches your code and automatically reloads it, making it easy for you to change

your code without needing to restart anything.

To start the server, change into your project container directory (cd mysite), if you haven’t already, and run this

command:

python manage.py runserver

You’ll see something like this:

Validating models...

0 errors found.

Django version 1.4.2, using settings ’mysite.settings’

Development server is running at http://127.0.0.1:8000/

Quit the server with CONTROL-C.

4.5. Starting a Project 19

The Django Book, Release 2.0

This launches the server locally, on port 8000, accessible only to connections from your own computer. Now that it’s

running, visit http://127.0.0.1:8000/ with your Web browser. You might see a different Django version depending on

which version of Django you have installed. You’ll see a “Welcome to Django” page shaded in a pleasant pastel blue.

It worked!

One ﬁnal, important note about the development server is worth mentioning before proceeding. Although this server

is convenient for development, resist the temptation to use it in anything resembling a production environment. The

development server can handle only a single request at a time reliably, and it has not gone through a security audit of

any sort. When the time comes to launch your site, see Chapter 12 for information on how to deploy Django.

Changing the Development Server’s Host or Port

By default, the runserver command starts the development server on port 8000, listening only for local connections.

If you want to change the server’s port, pass it as a command-line argument:

python manage.py runserver 8080

By specifying an IP address, you can tell the server to allow non-local connections. This is especially helpful if you’d

like to share a development site with other members of your team. The IP address 0.0.0.0 tells the server to listen

on any network interface:

python manage.py runserver 0.0.0.0:8000

When you’ve done this, other computers on your local network will be able to view your Django site by visiting your

IP address in their Web browsers, e.g., http://192.168.1.103:8000/ . (Note that you’ll have to consult your network

settings to determine your IP address on the local network. Unix users, try running “ifconﬁg” in a command prompt

to get this information. Windows users, try “ipconﬁg”.)

4.6 What’s Next?

Now that you have everything installed and the development server running, you’re ready to :doc: learn the basics

Chapter 3, of serving Web pages with Django.

20 Chapter 4. Chapter 2: Getting Started

CHAPTER 5

Chapter 3: Views and URLconfs

In the previous chapter, we explained how to set up a Django project and run the Django development server. In this

chapter, you’ll learn the basics of creating dynamic Web pages with Django.

5.1 Your First Django-Powered Page: Hello World

As our ﬁrst goal, let’s create a Web page that outputs that famous example message: “Hello world.”

If you were publishing a simple “Hello world” Web page without a Web framework, you’d simply type

“Hello world” into a text ﬁle, call it hello.html, and upload it to a directory on a Web server some-

where. Notice, in that process, you’ve speciﬁed two key pieces of information about that Web page: its con-

tents (the string "Hello world") and its URL ( http://www.example.com/hello.html, or maybe

http://www.example.com/files/hello.html if you put it in a subdirectory).

With Django, you specify those same two things, but in a different way. The contents of the page are produced by a

view function, and the URL is speciﬁed in a URLconf. First, let’s write our “Hello world” view function.

5.1.1 Your First View

Within the mysite directory that django-admin.py startproject made in the last chapter, create an empty

ﬁle called views.py. This Python module will contain our views for this chapter. Note that there’s nothing special

about the name views.py – Django doesn’t care what the ﬁle is called, as you’ll see in a bit – but it’s a good idea to

call it views.py as a convention, for the beneﬁt of other developers reading your code.

Our “Hello world” view is simple. Here’s the entire function, plus import statements, which you should type into the

views.py ﬁle:

from django.http import HttpResponse

def hello(request):

return HttpResponse("Hello world")

Let’s step through this code one line at a time:

• First, we import the class HttpResponse, which lives in the django.http module. We need to import this

class because it’s used later in our code.

• Next, we deﬁne a function called hello – the view function.

Each view function takes at least one parameter, called request by convention. This is an object that con-

tains information about the current Web request that has triggered this view, and it’s an instance of the class

The Django Book, Release 2.0

django.http.HttpRequest. In this example, we don’t do anything with request, but it must be the

ﬁrst parameter of the view nonetheless.

Note that the name of the view function doesn’t matter; it doesn’t have to be named in a certain way in order for

Django to recognize it. We’re calling it hello here, because that name clearly indicates the gist of the view, but

it could just as well be named hello_wonderful_beautiful_world, or something equally revolting.

The next section, “Your First URLconf”, will shed light on how Django ﬁnds this function.

• The function is a simple one-liner: it merely returns an HttpResponse object that has been instantiated with

the text "Hello world".

The main lesson here is this: a view is just a Python function that takes an HttpRequest as its ﬁrst parameter and

returns an instance of HttpResponse. In order for a Python function to be a Django view, it must do these two

things. (There are exceptions, but we’ll get to those later.)

5.1.2 Your First URLconf

If, at this point, you ran python manage.py runserver again, you’d still see the “Welcome to Django” mes-

sage, with no trace of our “Hello world” view anywhere. That’s because our mysite project doesn’t yet know about

the hello view; we need to tell Django explicitly that we’re activating this view at a particular URL. (Continuing our

previous analogy of publishing static HTML ﬁles, at this point we’ve created the HTML ﬁle but haven’t uploaded it to

a directory on the server yet.) To hook a view function to a particular URL with Django, use a URLconf.

AURLconf is like a table of contents for your Django-powered Web site. Basically, it’s a mapping between URLs

and the view functions that should be called for those URLs. It’s how you tell Django, “For this URL, call this code,

and for that URL, call that code.” For example, “When somebody visits the URL /foo/, call the view function

foo_view(), which lives in the Python module views.py.”

When you executed django-admin.py startproject in the previous chapter, the script created a URLconf

for you automatically: the ﬁle urls.py. By default, it looks something like this:

from django.conf.urls import patterns, include, url

# Uncomment the next two lines to enable the admin:

# from django.contrib import admin

# admin.autodiscover()

urlpatterns =patterns(’’,

# Examples:

# url(r’^$’, ’mysite.views.home’, name=’home’),

# url(r’^mysite/’, include(’mysite.foo.urls’)),

# Uncomment the admin/doc line below to enable admin documentation:

# url(r’^admin/doc/’, include(’django.contrib.admindocs.urls’)),

# Uncomment the next line to enable the admin:

# url(r’^admin/’, include(admin.site.urls)),

)

This default URLconf includes some commonly used Django features commented out, so that activating those features

is as easy as uncommenting the appropriate lines. If we ignore the commented-out code, here’s the essence of a

URLconf:

from django.conf.urls.defaults import patterns, include, url

urlpatterns =patterns(’’,

)

22 Chapter 5. Chapter 3: Views and URLconfs

The Django Book, Release 2.0

Let’s step through this code one line at a time:

• The ﬁrst line imports three functions from the django.conf.urls.defaults module, which is Django’s

URLconf infrastructure: patterns,include, and urls.

• The second line calls the function patterns and saves the result into a variable called urlpatterns. The

patterns function gets passed only a single argument – the empty string. (The string can be used to supply a

common preﬁx for view functions, which we’ll cover in Chapter 8: Advanced Views and URLconfs.)

The main thing to note here is the variable urlpatterns, which Django expects to ﬁnd in your URLconf module.

This variable deﬁnes the mapping between URLs and the code that handles those URLs. By default, as we can see,

the URLconf is empty – your Django application is a blank slate. (As a side note, that’s how Django knew to show

you the “Welcome to Django” page in the last chapter. If your URLconf is empty, Django assumes you just started a

new project and, hence, displays that message.)

To add a URL and view to the URLconf, just add a mapping between a URL pattern and the view function. Here’s

how to hook in our hello view:

from django.conf.urls.defaults import patterns, include, url

from mysite.views import hello

urlpatterns =patterns(’’,

url(r’^hello/$’, hello),

)

(Note that we’ve removed the commented-out code for brevity. You can choose to leave those lines in, if you’d like.)

We made two changes here:

• First, we imported the hello view from its module – mysite/views.py, which translates into

mysite.views in Python import syntax. (This assumes mysite/views.py is on your Python path; see

the sidebar for details.)

• Next, we added the line url(r’^hello/$’, hello), to urlpatterns. This line is referred to as

aURLpattern. The url() function tells Django how to handle the url that you are conﬁguring. The ﬁrst

argument is a pattern-matching string (a regular expression; more on this in a bit) and the second argument is

the view function to use for that pattern. url() can take other optional arguments as well, which we’ll cover

in more depth in Chapter 8: Advanced Views and URLconfs.

Note: One more important detail we’ve introduced here is that rcharacter in front of the regular expression string.

This tells Python that the string is a “raw string” – its contents should not interpret backslashes. In normal Python

strings, backslashes are used for escaping special characters – such as in the string ’\n’, which is a one-character

string containing a newline. When you add the rto make it a raw string, Python does not apply its backslash escaping

– so, r’\n’ is a two-character string containing a literal backslash and a lowercase “n”. There’s a natural collision

between Python’s usage of backslashes and the backslashes that are found in regular expressions, so it’s strongly

suggested that you use raw strings any time you’re deﬁning a regular expression in Python. All of the URLpatterns in

this book will be raw strings.

In a nutshell, we just told Django that any request to the URL /hello/ should be handled by the hello view

function.

Your Python Path

Your Python path is the list of directories on your system where Python looks when you use the Python import

statement.

For example, let’s say your Python path is set to [’’, ’/usr/lib/python2.7/site-packages’,

’/home/username/djcode’]. If you execute the Python statement from foo import bar, Python

will look for a module called foo.py in the current directory. (The ﬁrst entry in the Python

5.1. Your First Django-Powered Page: Hello World 23

The Django Book, Release 2.0

path, an empty string, means “the current directory.”) If that ﬁle doesn’t exist, Python will look

for the ﬁle /usr/lib/python2.7/site-packages/foo.py. If that ﬁle doesn’t exist, it will try

/home/username/djcode/foo.py. Finally, if that ﬁle doesn’t exist, it will raise ImportError.

If you’re interested in seeing the value of your Python path, start the Python interactive interpreter and type this:

>>> import sys

>>> print sys.path

Generally you don’t have to worry about setting your Python path – Python and Django take care of things for you

automatically behind the scenes. (Setting the Python path is one of the things that the manage.py script does.)

It’s worth discussing the syntax of this URLpattern, as it may not be immediately obvious. Although we want to match

the URL /hello/, the pattern looks a bit different than that. Here’s why:

• Django removes the slash from the front of every incoming URL before it checks the URLpatterns. This means

that our URLpattern doesn’t include the leading slash in /hello/. (At ﬁrst, this may seem unintuitive, but this

requirement simpliﬁes things – such as the inclusion of URLconfs within other URLconfs, which we’ll cover in

Chapter 8.)

• The pattern includes a caret (^) and a dollar sign ($). These are regular expression characters that have a special

meaning: the caret means “require that the pattern matches the start of the string,” and the dollar sign means

“require that the pattern matches the end of the string.”

This concept is best explained by example. If we had instead used the pattern ’^hello/’ (without a dollar sign

at the end), then any URL starting with /hello/ would match, such as /hello/foo and /hello/bar, not

just /hello/. Similarly, if we had left off the initial caret character (i.e., ’hello/$’), Django would match

any URL that ends with hello/, such as /foo/bar/hello/. If we had simply used hello/, without a

caret or dollar sign, then any URL containing hello/ would match, such as /foo/hello/bar. Thus, we

use both the caret and dollar sign to ensure that only the URL /hello/ matches – nothing more, nothing less.

Most of your URLpatterns will start with carets and end with dollar signs, but it’s nice to have the ﬂexibility to

perform more sophisticated matches.

You may be wondering what happens if someone requests the URL /hello (that is, without a trailing slash).

Because our URLpattern requires a trailing slash, that URL would not match. However, by default, any request

to a URL that doesn’t match a URLpattern and doesn’t end with a slash will be redirected to the same URL with

a trailing slash. (This is regulated by the APPEND_SLASH Django setting, which is covered in Appendix D.)

If you’re the type of person who likes all URLs to end with slashes (which is the preference of Django’s de-

velopers), all you’ll need to do is add a trailing slash to each URLpattern and leave APPEND_SLASH set to

True. If you prefer your URLs not to have trailing slashes, or if you want to decide it on a per-URL basis, set

APPEND_SLASH to False and put trailing slashes in your URLpatterns as you see ﬁt.

The other thing to note about this URLconf is that we’ve passed the hello view function as an object without calling

the function. This is a key feature of Python (and other dynamic languages): functions are ﬁrst-class objects, which

means you can pass them around just like any other variables. Cool stuff, eh?

To test our changes to the URLconf, start the Django development server, as you did in Chapter 2, by running the

command python manage.py runserver. (If you left it running, that’s ﬁne, too. The development server

automatically detects changes to your Python code and reloads as necessary, so you don’t have to restart the server

between changes.) The server is running at the address http://127.0.0.1:8000/, so open up a Web browser

and go to http://127.0.0.1:8000/hello/. You should see the text “Hello world” – the output of your

Django view.

Hooray! You’ve made your ﬁrst Django-powered Web page.

Regular Expressions

24 Chapter 5. Chapter 3: Views and URLconfs

The Django Book, Release 2.0

Regular expressions (or regexes) are a compact way of specifying patterns in text. While Django URLconfs allow

arbitrary regexes for powerful URL matching, you’ll probably only use a few regex symbols in practice. Here’s a

selection of common symbols:

Symbol Matches

.(dot) Any single character

\d Any single digit

[A-Z] Any character between Aand Z(uppercase)

[a-z] Any character between aand z(lowercase)

[A-Za-z] Any character between aand z(case-insensitive)

+One or more of the previous expression (e.g., \d+ matches one or more digits)

[^/]+ One or more characters until (and not including) a forward slash

?Zero or one of the previous expression (e.g., \d? matches zero or one digits)

*Zero or more of the previous expression (e.g., \d*matches zero, one or more than one digit)

{1,3} Between one and three (inclusive) of the previous expression (e.g., \d{1,3} matches one, two or

three digits)

For more on regular expressions, see http://www.djangoproject.com/r/python/re-module/.

5.1.3 A Quick Note About 404 Errors

At this point, our URLconf deﬁnes only a single URLpattern: the one that handles requests to the URL /hello/.

What happens when you request a different URL?

To ﬁnd out, try running the Django development server and visiting a page such as

http://127.0.0.1:8000/goodbye/ or http://127.0.0.1:8000/hello/subdirectory/,

or even http://127.0.0.1:8000/ (the site “root”). You should see a “Page not found” message (see Figure

3-1). Django displays this message because you requested a URL that’s not deﬁned in your URLconf.

The utility of this page goes beyond the basic 404 error message. It also tells you precisely which URLconf Django

used and every pattern in that URLconf. From that information, you should be able to tell why the requested URL

threw a 404.

Naturally, this is sensitive information intended only for you, the Web developer. If this were a production site deployed

live on the Internet, you wouldn’t want to expose that information to the public. For that reason, this “Page not found”

page is only displayed if your Django project is in debug mode. We’ll explain how to deactivate debug mode later. For

now, just know that every Django project is in debug mode when you ﬁrst create it, and if the project is not in debug

mode, Django outputs a different 404 response.

5.1.4 A Quick Note About The Site Root

As explained in the last section, you’ll see a 404 error message if you view the site root –

http://127.0.0.1:8000/. Django doesn’t add magically anything to the site root; that URL is not special-

cased in any way. It’s up to you to assign it to a URLpattern, just like every other entry in your URLconf.

The URLpattern to match the site root is a bit unintuitive, though, so it’s worth mentioning. When you’re ready to

implement a view for the site root, use the URLpattern ’^$’, which matches an empty string. For example:

from mysite.views import hello, my_homepage_view

urlpatterns =patterns(’’,

url(r’^$’, my_homepage_view),

# ...

)

5.1. Your First Django-Powered Page: Hello World 25

The Django Book, Release 2.0

Figure 5.1: Figure 3-1. Django’s 404 page

26 Chapter 5. Chapter 3: Views and URLconfs

The Django Book, Release 2.0

5.2 How Django Processes a Request

Before continuing to our second view function, let’s pause to learn a little more about how Django works. Speciﬁ-

cally, when you view your “Hello world” message by visiting http://127.0.0.1:8000/hello/ in your Web

browser, what does Django do behind the scenes?

It all starts with the settings ﬁle. When you run python manage.py runserver, the script looks for a ﬁle

called settings.py in the inner mysite directory. This ﬁle contains all sorts of conﬁguration for this partic-

ular Django project, all in uppercase: TEMPLATE_DIRS,DATABASES, etc. The most important setting is called

ROOT_URLCONF.ROOT_URLCONF tells Django which Python module should be used as the URLconf for this Web

site.

Remember when django-admin.py startproject created the ﬁles settings.py and urls.py? The

autogenerated settings.py contains a ROOT_URLCONF setting that points to the autogenerated urls.py. Open

the settings.py ﬁle and see for yourself; it should look like this:

ROOT_URLCONF =’mysite.urls’

This corresponds to the ﬁle mysite/urls.py.

When a request comes in for a particular URL – say, a request for /hello/ – Django loads the URLconf pointed to

by the ROOT_URLCONF setting. Then it checks each of the URLpatterns in that URLconf, in order, comparing the

requested URL with the patterns one at a time, until it ﬁnds one that matches. When it ﬁnds one that matches, it calls

the view function associated with that pattern, passing it an HttpRequest object as the ﬁrst parameter. (We’ll cover

the speciﬁcs of HttpRequest later.)

As we saw in our ﬁrst view example, a view function must return an HttpResponse. Once it does this, Django does

the rest, converting the Python object to a proper Web response with the appropriate HTTP headers and body (i.e., the

content of the Web page).

In summary:

1. A request comes in to /hello/.

2. Django determines the root URLconf by looking at the ROOT_URLCONF setting.

3. Django looks at all of the URLpatterns in the URLconf for the ﬁrst one that matches /hello/.

4. If it ﬁnds a match, it calls the associated view function.

5. The view function returns an HttpResponse.

6. Django converts the HttpResponse to the proper HTTP response, which results in a Web page.

You now know the basics of how to make Django-powered pages. It’s quite simple, really – just write view functions

and map them to URLs via URLconfs.

5.3 Your Second View: Dynamic Content

Our “Hello world” view was instructive in demonstrating the basics of how Django works, but it wasn’t an example

of a dynamic Web page, because the content of the page are always the same. Every time you view /hello/, you’ll

see the same thing; it might as well be a static HTML ﬁle.

For our second view, let’s create something more dynamic – a Web page that displays the current date and time. This

is a nice, simple next step, because it doesn’t involve a database or any user input – just the output of your server’s

internal clock. It’s only marginally more exciting than “Hello world,” but it’ll demonstrate a few new concepts.

5.2. How Django Processes a Request 27

The Django Book, Release 2.0

This view needs to do two things: calculate the current date and time, and return an HttpResponse containing that

value. If you have experience with Python, you know that Python includes a datetime module for calculating dates.

Here’s how to use it:

>>> import datetime

>>> now =datetime.datetime.now()

>>> now

datetime.datetime(2008, 12, 13, 14, 9, 39, 2731)

>>> print now

2008-12-13 14:09:39.002731

That’s simple enough, and it has nothing to do with Django. It’s just Python code. (We want to emphasize that you

should be aware of what code is “just Python” vs. code that is Django-speciﬁc. As you learn Django, we want you to

be able to apply your knowledge to other Python projects that don’t necessarily use Django.)

To make a Django view that displays the current date and time, then, we just need to hook this

datetime.datetime.now() statement into a view and return an HttpResponse. Here’s how that looks:

from django.http import HttpResponse

import datetime

def current_datetime(request):

now =datetime.datetime.now()

html ="<html><body>It is now %s.</body></html>" %now

return HttpResponse(html)

As with our hello view function, this should live in views.py. Note that we’ve hidden the hello function from

this example for brevity, but for the sake of completeness, here’s what the entire views.py looks like:

from django.http import HttpResponse

import datetime

def hello(request):

return HttpResponse("Hello world")

def current_datetime(request):

now =datetime.datetime.now()

html ="<html><body>It is now %s.</body></html>" %now

return HttpResponse(html)

(From now on, we won’t display previous code in code examples, except when necessary. You should be able to tell

from context which parts of an example are new vs. old.)

Let’s step through the changes we’ve made to views.py to accommodate the current_datetime view.

• We’ve added an import datetime to the top of the module, so we can calculate dates.

• The new current_datetime function calculates the current date and time, as a datetime.datetime

object, and stores that as the local variable now.

• The second line of code within the view constructs an HTML response using Python’s “format-string” capability.

The %s within the string is a placeholder, and the percent sign after the string means “Replace the %s in the

preceding string with the value of the variable now.” The now variable is technically a datetime.datetime

object, not a string, but the %s format character converts it to its string representation, which is something like

"2008-12-13 14:09:39.002731". This will result in an HTML string such as "<html><body>It

is now 2008-12-13 14:09:39.002731.</body></html>".

(Yes, our HTML is invalid, but we’re trying to keep the example simple and short.)

• Finally, the view returns an HttpResponse object that contains the generated response – just as we did in

hello.

28 Chapter 5. Chapter 3: Views and URLconfs

The Django Book, Release 2.0

After adding that to views.py, add the URLpattern to urls.py to tell Django which URL should handle this view.

Something like /time/ would make sense:

from django.conf.urls.defaults import patterns, include, url

from mysite.views import hello, current_datetime

urlpatterns =patterns(’’,

url(r’^hello/$’, hello),

url(r’^time/$’, current_datetime),

)

We’ve made two changes here. First, we imported the current_datetime function at the top. Second, and more

importantly, we added a URLpattern mapping the URL /time/ to that new view. Getting the hang of this?

With the view written and URLconf updated, ﬁre up the runserver and visit

http://127.0.0.1:8000/time/ in your browser. You should see the current date and time.

Django’s Time Zone

Depending on your computer, the date and time may be a few hours off. That’s because Django is time zone-aware

and defaults to the America/Chicago time zone. (It has to default to something, and that’s the time zone where

the original developers live.) If you live elsewhere, you’ll want to change it in settings.py. See the comment in

that ﬁle for a link to an up-to-date list of worldwide time zone options.

5.4 URLconfs and Loose Coupling

Now’s a good time to highlight a key philosophy behind URLconfs and behind Django in general: the principle of

loose coupling. Simply put, loose coupling is a software-development approach that values the importance of making

pieces interchangeable. If two pieces of code are loosely coupled, then changes made to one of the pieces will have

little or no effect on the other.

Django’s URLconfs are a good example of this principle in practice. In a Django web application, the URL deﬁnitions

and the view functions they call are loosely coupled; that is, the decision of what the URL should be for a given

function, and the implementation of the function itself, reside in two separate places. This lets you switch out one

piece without affecting the other.

For example, consider our current_datetime view. If we wanted to change the URL for the application – say,

to move it from /time/ to /current-time/ – we could make a quick change to the URLconf, without having

to worry about the view itself. Similarly, if we wanted to change the view function – altering its logic somehow – we

could do that without affecting the URL to which the function is bound.

Furthermore, if we wanted to expose the current-date functionality at several URLs, we could easily take care of

that by editing the URLconf, without having to touch the view code. In this example, our current_datetime is

available at two URLs. It’s a contrived example, but this technique can come in handy:

urlpatterns =patterns(’’,

url(r’^hello/$’, hello),

url(r’^time/$’, current_datetime),

url(r’^another-time-page/$’, current_datetime),

)

URLconfs and views are loose coupling in action. We’ll continue to point out examples of this important philosophy

throughout this book.

5.4. URLconfs and Loose Coupling 29

The Django Book, Release 2.0

5.5 Your Third View: Dynamic URLs

In our current_datetime view, the contents of the page – the current date/time – were dynamic, but the URL

(/time/) was static. In most dynamic Web applications, though, a URL contains parameters that inﬂuence the

output of the page. For example, an online bookstore might give each book its own URL, like /books/243/ and

/books/81196/.

Let’s create a third view that displays the current date and time offset by a certain number of hours. The goal is

to craft a site in such a way that the page /time/plus/1/ displays the date/time one hour into the future, the

page /time/plus/2/ displays the date/time two hours into the future, the page /time/plus/3/ displays the

date/time three hours into the future, and so on.

A novice might think to code a separate view function for each hour offset, which might result in a URLconf like this:

urlpatterns =patterns(’’,

url(r’^time/$’, current_datetime),

url(r’^time/plus/1/$’, one_hour_ahead),

url(r’^time/plus/2/$’, two_hours_ahead),

url(r’^time/plus/3/$’, three_hours_ahead),

url(r’^time/plus/4/$’, four_hours_ahead),

)

Clearly, this line of thought is ﬂawed. Not only would this result in redundant view functions, but also the application

is fundamentally limited to supporting only the predeﬁned hour ranges – one, two, three or four hours. If we decided

to create a page that displayed the time ﬁve hours into the future, we’d have to create a separate view and URLconf

line for that, furthering the duplication. We need to do some abstraction here.

A Word About Pretty URLs

If you’re experienced in another Web development platform, such as PHP or Java, you may be thinking, “Hey, let’s

use a query string parameter!” – something like /time/plus?hours=3, in which the hours would be designated

by the hours parameter in the URL’s query string (the part after the ?).

You can do that with Django (and we’ll tell you how in Chapter 7), but one of Django’s core philosophies is that URLs

should be beautiful. The URL /time/plus/3/ is far cleaner, simpler, more readable, easier to recite to somebody

aloud and . . . just plain prettier than its query string counterpart. Pretty URLs are a characteristic of a quality Web

application.

Django’s URLconf system encourages pretty URLs by making it easier to use pretty URLs than not to.

How, then do we design our application to handle arbitrary hour offsets? The key is to use wildcard URLpatterns. As

we mentioned previously, a URLpattern is a regular expression; hence, we can use the regular expression pattern \d+

to match one or more digits:

urlpatterns =patterns(’’,

# ...

url(r’^time/plus/\d+/$’, hours_ahead),

# ...

)

(We’re using the # ... to imply there might be other URLpatterns that we trimmed from this example.)

This new URLpattern will match any URL such as /time/plus/2/,/time/plus/25/, or even

/time/plus/100000000000/. Come to think of it, let’s limit it so that the maximum allowed offset is 99 hours.

That means we want to allow either one- or two-digit numbers – and in regular expression syntax, that translates into

\d{1,2}:

30 Chapter 5. Chapter 3: Views and URLconfs

The Django Book, Release 2.0

url(r’^time/plus/\d{1,2}/$’, hours_ahead),

Note: When building Web applications, it’s always important to consider the most outlandish data input possible, and

decide whether or not the application should support that input. We’ve curtailed the outlandishness here by limiting

the offset to 99 hours.

Now that we’ve designated a wildcard for the URL, we need a way of passing that wildcard data to the view function,

so that we can use a single view function for any arbitrary hour offset. We do this by placing parentheses around the

data in the URLpattern that we want to save. In the case of our example, we want to save whatever number was entered

in the URL, so let’s put parentheses around the \d{1,2}, like this:

url(r’^time/plus/(\d{1,2})/$’, hours_ahead),

If you’re familiar with regular expressions, you’ll be right at home here; we’re using parentheses to capture data from

the matched text.

The ﬁnal URLconf, including our previous two views, looks like this:

from django.conf.urls.defaults import *

from mysite.views import hello, current_datetime, hours_ahead

urlpatterns =patterns(’’,

url(r’^hello/$’, hello),

url(r’^time/$’, current_datetime),

url(r’^time/plus/(\d{1,2})/$’, hours_ahead),

)

With that taken care of, let’s write the hours_ahead view.

hours_ahead is very similar to the current_datetime view we wrote earlier, with a key difference: it takes

an extra argument, the number of hours of offset. Here’s the view code:

from django.http import Http404, HttpResponse

import datetime

def hours_ahead(request, offset):

try:

offset =int(offset)

except ValueError:

raise Http404()

dt =datetime.datetime.now() +datetime.timedelta(hours=offset)

html ="<html><body>In %s hour(s), it will be %s.</body></html>" %(offset, dt)

return HttpResponse(html)

Let’s step through this code one line at a time:

• The view function, hours_ahead, takes two parameters: request and offset.

–request is an HttpRequest object, just as in hello and current_datetime. We’ll say it again:

each view always takes an HttpRequest object as its ﬁrst parameter.

–offset is the string captured by the parentheses in the URLpattern. For example, if the requested

URL were /time/plus/3/, then offset would be the string ’3’. If the requested URL were

/time/plus/21/, then offset would be the string ’21’. Note that captured values will always

be strings, not integers, even if the string is composed of only digits, such as ’21’.

(Technically, captured values will always be Unicode objects, not plain Python bytestrings, but don’t worry

about this distinction at the moment.)

5.5. Your Third View: Dynamic URLs 31

The Django Book, Release 2.0

We decided to call the variable offset, but you can call it whatever you’d like, as long as it’s a valid

Python identiﬁer. The variable name doesn’t matter; all that matters is that it’s the second argument to

the function, after request. (It’s also possible to use keyword, rather than positional, arguments in an

URLconf. We cover that in Chapter 8.)

• The ﬁrst thing we do within the function is call int() on offset. This converts the string value to an integer.

Note that Python will raise a ValueError exception if you call int() on a value that cannot be converted to

an integer, such as the string ’foo’. In this example, if we encounter the ValueError, we raise the exception

django.http.Http404, which, as you can imagine, results in a 404 “Page not found” error.

Astute readers will wonder: how could we ever reach the ValueError case, anyway, given that the regular

expression in our URLpattern – (\d{1,2}) – captures only digits, and therefore offset will only ever be a

string composed of digits? The answer is, we won’t, because the URLpattern provides a modest but useful level

of input validation, but we still check for the ValueError in case this view function ever gets called in some

other way. It’s good practice to implement view functions such that they don’t make any assumptions about

their parameters. Loose coupling, remember?

• In the next line of the function, we calculate the current date/time and add the appropriate number of hours.

We’ve already seen datetime.datetime.now() from the current_datetime view; the new concept

here is that you can perform date/time arithmetic by creating a datetime.timedelta object and adding to

adatetime.datetime object. Our result is stored in the variable dt.

This line also shows why we called int() on offset – the datetime.timedelta function requires the

hours parameter to be an integer.

• Next, we construct the HTML output of this view function, just as we did in current_datetime. A small

difference in this line from the previous line is that it uses Python’s format-string capability with two values, not

just one. Hence, there are two %s symbols in the string and a tuple of values to insert: (offset, dt).

• Finally, we return an HttpResponse of the HTML. By now, this is old hat.

With that view function and URLconf written, start the Django development server (if it’s not al-

ready running), and visit http://127.0.0.1:8000/time/plus/3/ to verify it works. Then try

http://127.0.0.1:8000/time/plus/5/. Then http://127.0.0.1:8000/time/plus/24/. Fi-

nally, visit http://127.0.0.1:8000/time/plus/100/ to verify that the pattern in your URLconf only ac-

cepts one- or two-digit numbers; Django should display a “Page not found” error in this case, just as we saw in the

section “A Quick Note About 404 Errors” earlier. The URL http://127.0.0.1:8000/time/plus/ (with no

hour designation) should also throw a 404.

Coding Order

In this example, we wrote the URLpattern ﬁrst and the view second, but in the previous examples, we wrote the view

ﬁrst, then the URLpattern. Which technique is better?

Well, every developer is different.

If you’re a big-picture type of person, it may make the most sense to you to write all of the URLpatterns for your

application at the same time, at the start of your project, and then code up the views. This has the advantage of giving

you a clear to-do list, and it essentially deﬁnes the parameter requirements for the view functions you’ll need to write.

If you’re more of a bottom-up developer, you might prefer to write the views ﬁrst, and then anchor them to URLs

afterward. That’s OK, too.

In the end, it comes down to which technique ﬁts your brain the best. Both approaches are valid.

32 Chapter 5. Chapter 3: Views and URLconfs

The Django Book, Release 2.0

5.6 Django’s Pretty Error Pages

Take a moment to admire the ﬁne Web application we’ve made so far . . . now let’s break it! Let’s deliberately

introduce a Python error into our views.py ﬁle by commenting out the offset = int(offset) lines in the

hours_ahead view:

def hours_ahead(request, offset):

# try:

# offset = int(offset)

# except ValueError:

# raise Http404()

dt =datetime.datetime.now() +datetime.timedelta(hours=offset)

html ="<html><body>In %s hour(s), it will be %s.</body></html>" %(offset, dt)

return HttpResponse(html)

Load up the development server and navigate to /time/plus/3/. You’ll see an error page with a signiﬁcant

amount of information, including a TypeError message displayed at the very top: "unsupported type for

timedelta hours component: unicode".

What happened? Well, the datetime.timedelta function expects the hours parameter to be an integer, and we

commented out the bit of code that converted offset to an integer. That caused datetime.timedelta to raise

the TypeError. It’s the typical kind of small bug that every programmer runs into at some point.

The point of this example was to demonstrate Django’s error pages. Take some time to explore the error page and get

to know the various bits of information it gives you.

Here are some things to notice:

• At the top of the page, you get the key information about the exception: the type of exception, any parameters to

the exception (the "unsupported type" message in this case), the ﬁle in which the exception was raised,

and the offending line number.

• Under the key exception information, the page displays the full Python traceback for this exception. This is

similar to the standard traceback you get in Python’s command-line interpreter, except it’s more interactive.

For each level (“frame”) in the stack, Django displays the name of the ﬁle, the function/method name, the line

number, and the source code of that line.

Click the line of source code (in dark gray), and you’ll see several lines from before and after the erroneous line,

to give you context.

Click “Local vars” under any frame in the stack to view a table of all local variables and their values, in that

frame, at the exact point in the code at which the exception was raised. This debugging information can be a

great help.

• Note the “Switch to copy-and-paste view” text under the “Traceback” header. Click those words, and the

traceback will switch to a alternate version that can be easily copied and pasted. Use this when you want to

share your exception traceback with others to get technical support – such as the kind folks in the Django IRC

chat room or on the Django users mailing list.

Underneath, the “Share this traceback on a public Web site” button will do this work for you in just one click.

Click it to post the traceback to http://www.dpaste.com/, where you’ll get a distinct URL that you can share with

other people.

• Next, the “Request information” section includes a wealth of information about the incoming Web request that

spawned the error: GET and POST information, cookie values, and meta information, such as CGI headers.

Appendix G has a complete reference of all the information a request object contains.

Below the “Request information” section, the “Settings” section lists all of the settings for this particular Django

installation. (We’ve already mentioned ROOT_URLCONF, and we’ll show you various Django settings through-

out the book. All the available settings are covered in detail in Appendix D.)

5.6. Django’s Pretty Error Pages 33

The Django Book, Release 2.0

The Django error page is capable of displaying more information in certain special cases, such as the case of template

syntax errors. We’ll get to those later, when we discuss the Django template system. For now, uncomment the offset

= int(offset) lines to get the view function working properly again.

Are you the type of programmer who likes to debug with the help of carefully placed print statements? You can use

the Django error page to do so – just without the print statements. At any point in your view, temporarily insert an

assert False to trigger the error page. Then, you can view the local variables and state of the program. Here’s an

example, using the hours_ahead view:

def hours_ahead(request, offset):

try:

offset =int(offset)

except ValueError:

raise Http404()

dt =datetime.datetime.now() +datetime.timedelta(hours=offset)

assert False

html ="<html><body>In %s hour(s), it will be %s.</body></html>" %(offset, dt)

return HttpResponse(html)

Finally, it’s obvious that much of this information is sensitive – it exposes the innards of your Python code and Django

conﬁguration – and it would be foolish to show this information on the public Internet. A malicious person could use

it to attempt to reverse-engineer your Web application and do nasty things. For that reason, the Django error page is

only displayed when your Django project is in debug mode. We’ll explain how to deactivate debug mode in Chapter

12. For now, just know that every Django project is in debug mode automatically when you start it. (Sound familiar?

The “Page not found” errors, described earlier in this chapter, work the same way.)

5.7 What’s next?

So far, we’ve been writing our view functions with HTML hard-coded directly in the Python code. We’ve done that to

keep things simple while we demonstrated core concepts, but in the real world, this is nearly always a bad idea.

Django ships with a simple yet powerful template engine that allows you to separate the design of the page from the

underlying code. We’ll dive into Django’s template engine in the next chapter Chapter 4.

34 Chapter 5. Chapter 3: Views and URLconfs

CHAPTER 6

Chapter 4: Templates

In the previous chapter, you may have noticed something peculiar in how we returned the text in our example views.

Namely, the HTML was hard-coded directly in our Python code, like this:

def current_datetime(request):

now =datetime.datetime.now()

html ="<html><body>It is now %s.</body></html>" %now

return HttpResponse(html)

Although this technique was convenient for the purpose of explaining how views work, it’s not a good idea to hard-

code HTML directly in your views. Here’s why:

• Any change to the design of the page requires a change to the Python code. The design of a site tends to change

far more frequently than the underlying Python code, so it would be convenient if the design could change

without needing to modify the Python code.

• Writing Python code and designing HTML are two different disciplines, and most professional Web develop-

ment environments split these responsibilities between separate people (or even separate departments). Design-

ers and HTML/CSS coders shouldn’t be required to edit Python code to get their job done.

• It’s most efﬁcient if programmers can work on Python code and designers can work on templates at the same

time, rather than one person waiting for the other to ﬁnish editing a single ﬁle that contains both Python and

HTML.

For these reasons, it’s much cleaner and more maintainable to separate the design of the page from the Python code

itself. We can do this with Django’s template system, which we discuss in this chapter.

6.1 Template System Basics

A Django template is a string of text that is intended to separate the presentation of a document from its data. A

template deﬁnes placeholders and various bits of basic logic (template tags) that regulate how the document should be

displayed. Usually, templates are used for producing HTML, but Django templates are equally capable of generating

any text-based format.

Let’s start with a simple example template. This Django template describes an HTML page that thanks a person for

placing an order with a company. Think of it as a form letter:

<html>

<head><title>Ordering notice</title></head>

<body>

<h1>Ordering notice</h1>

The Django Book, Release 2.0

<p>Dear {{ person_name }},</p>

<p>Thanks for placing an order from {{ company }}. It’s scheduled to

ship on {{ ship_date|date:"F j, Y" }}.</p>

<p>Here are the items you’ve ordered:</p>

<ul>

{% for item in item_list %}

{% endfor %}

</ul>

{% if ordered_warranty %}

<p>Your warranty information will be included in the packaging.</p>

{% else %}

<p>You didn’t order a warranty, so you’re on your own when

the products inevitably stop working.</p>

{% endif %}

<p>Sincerely,<br />{{ company }}</p>

</body>

</html>

This template is basic HTML with some variables and template tags thrown in. Let’s step through it:

• Any text surrounded by a pair of braces (e.g., {{ person_name }}) is a variable. This means “insert the

value of the variable with the given name.” (How do we specify the values of the variables? We’ll get to that in

a moment.)

• Any text that’s surrounded by curly braces and percent signs (e.g., {% if ordered_warranty %}) is a

template tag. The deﬁnition of a tag is quite broad: a tag just tells the template system to “do something.”

This example template contains a for tag ({% for item in item_list %}) and an if tag ({% if

ordered_warranty %}).

Afor tag works very much like a for statement in Python, letting you loop over each item in a sequence.

An if tag, as you may expect, acts as a logical “if” statement. In this particular case, the tag checks whether

the value of the ordered_warranty variable evaluates to True. If it does, the template system will display

everything between the {% if ordered_warranty %} and {% else %}. If not, the template system

will display everything between {% else %} and {% endif %}. Note that the {% else %} is optional.

• Finally, the second paragraph of this template contains an example of a ﬁlter, which is the most convenient way

to alter the formatting of a variable. In this example, {{ ship_date|date:"F j, Y" }}, we’re passing

the ship_date variable to the date ﬁlter, giving the date ﬁlter the argument "F j, Y". The date ﬁlter

formats dates in a given format, as speciﬁed by that argument. Filters are attached using a pipe character (|), as

a reference to Unix pipes.

Each Django template has access to several built-in tags and ﬁlters, many of which are discussed in the sections that

follow. Appendix E contains the full list of tags and ﬁlters, and it’s a good idea to familiarize yourself with that list so

you know what’s possible. It’s also possible to create your own ﬁlters and tags; we’ll cover that in Chapter 9.

6.2 Using the Template System

Let’s dive into Django’s template system so you can see how it works – but we’re not yet going to integrate it with the

views that we created in the previous chapter. Our goal here is to show you how the system works independently of

36 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

the rest of Django. (Put another way: usually you’ll be using the template system within a Django view, but we want

to make it clear that the template system is just a Python library that you can use anywhere, not just in Django views.)

Here is the most basic way you can use Django’s template system in Python code:

1. Create a Template object by providing the raw template code as a string.

2. Call the render() method of the Template object with a given set of variables (the context). This returns a

fully rendered template as a string, with all of the variables and template tags evaluated according to the context.

In code, here’s what that looks like:

>>> from django import template

>>> t=template.Template(’My name is {{ name }}.’)

>>> c=template.Context({’name’:’Adrian’})

>>> print t.render(c)

My name is Adrian.

>>> c=template.Context({’name’:’Fred’})

>>> print t.render(c)

My name is Fred.

The following sections describe each step in much more detail.

6.2.1 Creating Template Objects

The easiest way to create a Template object is to instantiate it directly. The Template class lives in the

django.template module, and the constructor takes one argument, the raw template code. Let’s dip into the

Python interactive interpreter to see how this works in code.

From the mysite project directory created by django-admin.py startproject (as covered in Chapter 2),

type python manage.py shell to start the interactive interpreter.

A special Python prompt

If you’ve used Python before, you may be wondering why we’re running python manage.py shell instead

of just python. Both commands will start the interactive interpreter, but the manage.py shell command has

one key difference: before starting the interpreter, it tells Django which settings ﬁle to use. Many parts of Django,

including the template system, rely on your settings, and you won’t be able to use them unless the framework knows

which settings to use.

If you’re curious, here’s how it works behind the scenes. Django looks for an environment variable called

DJANGO_SETTINGS_MODULE, which should be set to the import path of your settings.py. For example,

DJANGO_SETTINGS_MODULE might be set to ’mysite.settings’, assuming mysite is on your Python path.

When you run python manage.py shell, the command takes care of setting DJANGO_SETTINGS_MODULE

for you. We’re encouraging you to use python manage.py shell in these examples so as to minimize the

amount of tweaking and conﬁguring you have to do.

Let’s go through some template system basics:

>>> from django.template import Template

>>> t=Template(’My name is {{ name }}.’)

>>> print t

If you’re following along interactively, you’ll see something like this:

<django.template.Template object at 0xb7d5f24c>

6.2. Using the Template System 37

The Django Book, Release 2.0

That 0xb7d5f24c will be different every time, and it isn’t relevant; it’s a Python thing (the Python “identity” of the

Template object, if you must know).

When you create a Template object, the template system compiles the raw template code into an internal, optimized

form, ready for rendering. But if your template code includes any syntax errors, the call to Template() will cause

aTemplateSyntaxError exception:

>>> from django.template import Template

>>> t=Template(’{% notatag %}’)

Traceback (most recent call last):

File "<stdin>", line 1, in ?

...

django.template.TemplateSyntaxError: Invalid block tag: ’notatag’

The term “block tag” here refers to {% notatag %}. “Block tag” and “template tag” are synonymous.

The system raises a TemplateSyntaxError exception for any of the following cases:

• Invalid tags

• Invalid arguments to valid tags

• Invalid ﬁlters

• Invalid arguments to valid ﬁlters

• Invalid template syntax

• Unclosed tags (for tags that require closing tags)

6.2.2 Rendering a Template

Once you have a Template object, you can pass it data by giving it a context. A context is simply a set of template

variable names and their associated values. A template uses this to populate its variables and evaluate its tags.

A context is represented in Django by the Context class, which lives in the django.template module. Its

constructor takes one optional argument: a dictionary mapping variable names to variable values. Call the Template

object’s render() method with the context to “ﬁll” the template:

>>> from django.template import Context, Template

>>> t=Template(’My name is {{ name }}.’)

>>> c=Context({’name’:’Stephane’})

>>> t.render(c)

u’My name is Stephane.’

One thing we should point out here is that the return value of t.render(c) is a Unicode object – not a normal

Python string. You can tell this by the uin front of the string. Django uses Unicode objects instead of normal strings

throughout the framework. If you understand the repercussions of that, be thankful for the sophisticated things Django

does behind the scenes to make it work. If you don’t understand the repercussions of that, don’t worry for now; just

know that Django’s Unicode support makes it relatively painless for your applications to support a wide variety of

character sets beyond the basic “A-Z” of the English language.

Dictionaries and Contexts

A Python dictionary is a mapping between known keys and variable values. A Context is similar to a dictionary, but

aContext provides additional functionality, as covered in Chapter 9.

Variable names must begin with a letter (A-Z or a-z) and may contain more letters, digits, underscores, and dots. (Dots

are a special case we’ll get to in a moment.) Variable names are case sensitive.

38 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

Here’s an example of template compilation and rendering, using a template similar to the example in the beginning of

this chapter:

>>> from django.template import Template, Context

>>> raw_template ="""<p>Dear {{ person_name }},</p>

...

... <p>Thanks for placing an order from {{ company }}. It’s scheduled to

... ship on {{ ship_date|date:"F j, Y" }}.</p>

...

... {% if ordered_warranty %}

... <p>Your warranty information will be included in the packaging.</p>

... {% else %}

... <p>You didn’t order a warranty, so you’re on your own when

... the products inevitably stop working.</p>

... {% endif %}

...

... <p>Sincerely,<br />{{ company }}</p>"""

>>> t=Template(raw_template)

>>> import datetime

>>> c=Context({’person_name’:’John Smith’,

... ’company’:’Outdoor Equipment’,

... ’ship_date’: datetime.date(2009,4,2),

... ’ordered_warranty’:False})

>>> t.render(c)

u"<p>Dear John Smith,</p>\n\n<p>Thanks for placing an order from Outdoor

Equipment. It’s scheduled to\nship on April 2, 2009.</p>\n\n\n<p>You

didn’t order a warranty, so you’re on your own when\nthe products

inevitably stop working.</p>\n\n\n<p>Sincerely,<br />Outdoor Equipment

</p>"

Let’s step through this code one statement at a time:

• First, we import the classes Template and Context, which both live in the module django.template.

• We save the raw text of our template into the variable raw_template. Note that we use triple quote marks to

designate the string, because it wraps over multiple lines; in contrast, strings within single quote marks cannot

be wrapped over multiple lines.

• Next, we create a template object, t, by passing raw_template to the Template class constructor.

• We import the datetime module from Python’s standard library, because we’ll need it in the following state-

ment.

• Then, we create a Context object, c. The Context constructor takes a Python dictionary, which maps vari-

able names to values. Here, for example, we specify that the person_name is ’John Smith’,company

is ’Outdoor Equipment’, and so forth.

• Finally, we call the render() method on our template object, passing it the context. This returns the rendered

template – i.e., it replaces template variables with the actual values of the variables, and it executes any template

tags.

Note that the “You didn’t order a warranty” paragraph was displayed because the ordered_warranty vari-

able evaluated to False. Also note the date, April 2, 2009, which is displayed according to the format

string ’F j, Y’. (We’ll explain format strings for the date ﬁlter in a little while.)

If you’re new to Python, you may wonder why this output includes newline characters (’\n’) rather than

displaying the line breaks. That’s happening because of a subtlety in the Python interactive interpreter: the call

to t.render(c) returns a string, and by default the interactive interpreter displays the representation of the

string, rather than the printed value of the string. If you want to see the string with line breaks displayed as true

line breaks rather than ’\n’ characters, use the print statement: print t.render(c).

6.2. Using the Template System 39

The Django Book, Release 2.0

Those are the fundamentals of using the Django template system: just write a template string, create a Template

object, create a Context, and call the render() method.

6.2.3 Multiple Contexts, Same Template

Once you have a Template object, you can render multiple contexts through it. For example:

>>> from django.template import Template, Context

>>> t=Template(’Hello, {{ name }}’)

>>> print t.render(Context({’name’:’John’}))

Hello, John

>>> print t.render(Context({’name’:’Julie’}))

Hello, Julie

>>> print t.render(Context({’name’:’Pat’}))

Hello, Pat

Whenever you’re using the same template source to render multiple contexts like this, it’s more efﬁcient to create the

Template object once, and then call render() on it multiple times:

# Bad

for name in (’John’,’Julie’,’Pat’):

t=Template(’Hello, {{ name }}’)

print t.render(Context({’name’: name}))

# Good

t=Template(’Hello, {{ name }}’)

for name in (’John’,’Julie’,’Pat’):

print t.render(Context({’name’: name}))

Django’s template parsing is quite fast. Behind the scenes, most of the parsing happens via a call to a single regular

expression. This is in stark contrast to XML-based template engines, which incur the overhead of an XML parser and

tend to be orders of magnitude slower than Django’s template rendering engine.

6.2.4 Context Variable Lookup

In the examples so far, we’ve passed simple values in the contexts – mostly strings, plus a datetime.date example.

However, the template system elegantly handles more complex data structures, such as lists, dictionaries, and custom

objects.

The key to traversing complex data structures in Django templates is the dot character (.). Use a dot to access

dictionary keys, attributes, methods, or indices of an object.

This is best illustrated with a few examples. For instance, suppose you’re passing a Python dictionary to a template.

To access the values of that dictionary by dictionary key, use a dot:

>>> from django.template import Template, Context

>>> person ={’name’:’Sally’,’age’:’43’}

>>> t=Template(’{{ person.name }} is {{ person.age }} years old.’)

>>> c=Context({’person’: person})

>>> t.render(c)

u’Sally is 43 years old.’

Similarly, dots also allow access of object attributes. For example, a Python datetime.date object has year,

month, and day attributes, and you can use a dot to access those attributes in a Django template:

40 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

>>> from django.template import Template, Context

>>> import datetime

>>> d=datetime.date(1993,5,2)

>>> d.year

1993

>>> d.month

>>> d.day

>>> t=Template(’The month is {{ date.month }} and the year is {{ date.year }}.’)

>>> c=Context({’date’: d})

>>> t.render(c)

u’The month is 5 and the year is 1993.’

This example uses a custom class, demonstrating that variable dots also allow attribute access on arbitrary objects:

>>> from django.template import Template, Context

>>> class Person(object):

... def __init__(self, first_name, last_name):

... self.first_name, self.last_name =first_name, last_name

>>> t=Template(’Hello, {{ person.first_name }} {{ person.last_name }}.’)

>>> c=Context({’person’: Person(’John’,’Smith’)})

>>> t.render(c)

u’Hello, John Smith.’

Dots can also refer to methods on objects. For example, each Python string has the methods upper() and

isdigit(), and you can call those in Django templates using the same dot syntax:

>>> from django.template import Template, Context

>>> t=Template(’{{ var }} -- {{ var.upper }} -- {{ var.isdigit }}’)

>>> t.render(Context({’var’:’hello’}))

u’hello -- HELLO -- False’

>>> t.render(Context({’var’:’123’}))

u’123 -- 123 -- True’

Note that you do not include parentheses in the method calls. Also, it’s not possible to pass arguments to the methods;

you can only call methods that have no required arguments. (We explain this philosophy later in this chapter.)

Finally, dots are also used to access list indices, for example:

>>> from django.template import Template, Context

>>> t=Template(’Item 2 is {{ items.2 }}.’)

>>> c=Context({’items’: [’apples’,’bananas’,’carrots’]})

>>> t.render(c)

u’Item 2 is carrots.’

Negative list indices are not allowed. For example, the template variable {{ items.-1 }} would cause a

TemplateSyntaxError.

Python Lists

A reminder: Python lists have 0-based indices. The ﬁrst item is at index 0, the second is at index 1, and so on.

Dot lookups can be summarized like this: when the template system encounters a dot in a variable name, it tries the

following lookups, in this order:

• Dictionary lookup (e.g., foo["bar"])

• Attribute lookup (e.g., foo.bar)

• Method call (e.g., foo.bar())

6.2. Using the Template System 41

The Django Book, Release 2.0

• List-index lookup (e.g., foo[2])

The system uses the ﬁrst lookup type that works. It’s short-circuit logic.

Dot lookups can be nested multiple levels deep. For instance, the following example uses {{

person.name.upper }}, which translates into a dictionary lookup (person[’name’]) and then a method

call (upper()):

>>> from django.template import Template, Context

>>> person ={’name’:’Sally’,’age’:’43’}

>>> t=Template(’{{ person.name.upper }} is {{ person.age }} years old.’)

>>> c=Context({’person’: person})

>>> t.render(c)

u’SALLY is 43 years old.’

Method Call Behavior

Method calls are slightly more complex than the other lookup types. Here are some things to keep in mind:

• If, during the method lookup, a method raises an exception, the exception will be propagated, unless the ex-

ception has an attribute silent_variable_failure whose value is True. If the exception does have a

silent_variable_failure attribute, the variable will render as an empty string, for example:

>>> t=Template("My name is {{ person.first_name }}.")

>>> class PersonClass3:

... def first_name(self):

... raise AssertionError,"foo"

>>> p=PersonClass3()

>>> t.render(Context({"person": p}))

Traceback (most recent call last):

...

AssertionError: foo

>>> class SilentAssertionError(AssertionError):

... silent_variable_failure =True

>>> class PersonClass4:

... def first_name(self):

... raise SilentAssertionError

>>> p=PersonClass4()

>>> t.render(Context({"person": p}))

u’My name is .’

• A method call will only work if the method has no required arguments. Otherwise, the system will move to the

next lookup type (list-index lookup).

• Obviously, some methods have side effects, and it would be foolish at best, and possibly even a security hole, to

allow the template system to access them.

Say, for instance, you have a BankAccount object that has a delete() method. If a template includes

something like {{ account.delete }}, where account is a BankAccount object, the object would

be deleted when the template is rendered!

To prevent this, set the function attribute alters_data on the method:

def delete(self):

# Delete the account

delete.alters_data = True

42 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

The template system won’t execute any method marked in this way. Continuing the above example, if a template

includes {{ account.delete }} and the delete() method has the alters_data=True, then the

delete() method will not be executed when the template is rendered. Instead, it will fail silently.

How Invalid Variables Are Handled

By default, if a variable doesn’t exist, the template system renders it as an empty string, failing silently. For example:

>>> from django.template import Template, Context

>>> t=Template(’Your name is {{ name }}.’)

>>> t.render(Context())

u’Your name is .’

>>> t.render(Context({’var’:’hello’}))

u’Your name is .’

>>> t.render(Context({’NAME’:’hello’}))

u’Your name is .’

>>> t.render(Context({’Name’:’hello’}))

u’Your name is .’

The system fails silently rather than raising an exception because it’s intended to be resilient to human error. In this

case, all of the lookups failed because variable names have the wrong case or name. In the real world, it’s unacceptable

for a Web site to become inaccessible due to a small template syntax error.

6.2.5 Playing with Context Objects

Most of the time, you’ll instantiate Context objects by passing in a fully populated dictionary to Context(). But

you can add and delete items from a Context object once it’s been instantiated, too, using standard Python dictionary

syntax:

>>> from django.template import Context

>>> c=Context({"foo":"bar"})

>>> c[’foo’]

’bar’

>>> del c[’foo’]

>>> c[’foo’]

Traceback (most recent call last):

...

KeyError: ’foo’

>>> c[’newvariable’]=’hello’

>>> c[’newvariable’]

’hello’

6.3 Basic Template Tags and Filters

As we’ve mentioned already, the template system ships with built-in tags and ﬁlters. The sections that follow provide

a rundown of the most common tags and ﬁlters.

6.3.1 Tags

if/else

The {% if %} tag evaluates a variable, and if that variable is “True” (i.e., it exists, is not empty, and is not a false

Boolean value), the system will display everything between {% if %} and {% endif %}, for example:

6.3. Basic Template Tags and Filters 43

The Django Book, Release 2.0

{% if today_is_weekend %}

<p>Welcome to the weekend!</p>

{% endif %}

An {% else %} tag is optional:

{% if today_is_weekend %}

<p>Welcome to the weekend!</p>

{% else %}

{% endif %}

Python “Truthiness”

In Python and in the Django template system, these objects evaluate to False in a Boolean context:

• An empty list ([])

• An empty tuple (())

• An empty dictionary ({})

• An empty string (’’)

• Zero (0)

• The special object None

• The object False (obviously)

• Custom objects that deﬁne their own Boolean context behavior (this is advanced Python usage)

Everything else evaluates to True.

The {% if %} tag accepts and,or, or not for testing multiple variables, or to negate a given variable. For example:

{% if athlete_list and coach_list %}

Both athletes and coaches are available.

{% endif %}

{% if not athlete_list %}

There are no athletes.

{% endif %}

{% if athlete_list or coach_list %}

There are some athletes or some coaches.

{% endif %}

{% if not athlete_list or coach_list %}

There are no athletes or there are some coaches.

{% endif %}

{% if athlete_list and not coach_list %}

There are some athletes and absolutely no coaches.

{% endif %}

{% if %} tags don’t allow and and or clauses within the same tag, because the order of logic would be ambiguous.

For example, this is invalid:

{% if athlete_list and coach_list or cheerleader_list %}

44 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

The use of parentheses for controlling order of operations is not supported. If you ﬁnd yourself needing parentheses,

consider performing logic outside the template and passing the result of that as a dedicated template variable. Or, just

use nested {% if %} tags, like this:

{% if athlete_list %}

{% if coach_list or cheerleader_list %}

We have athletes, and either coaches or cheerleaders!

{% endif %}

Multiple uses of the same logical operator are ﬁne, but you can’t combine different operators. For example, this is

valid:

{% if athlete_list or coach_list or parent_list or teacher_list %}

There is no {% elif %} tag. Use nested {% if %} tags to accomplish the same thing:

{% if athlete_list %}

<p>Here are the athletes: {{ athlete_list }}.</p>

{% else %}

<p>No athletes are available.</p>

{% if coach_list %}

<p>Here are the coaches: {{ coach_list }}.</p>

{% endif %}

Make sure to close each {% if %} with an {% endif %}. Otherwise, Django will throw a

TemplateSyntaxError.

for

The {% for %} tag allows you to loop over each item in a sequence. As in Python’s for statement, the syntax is

for X in Y, where Yis the sequence to loop over and Xis the name of the variable to use for a particular cycle

of the loop. Each time through the loop, the template system will render everything between {% for %} and {%

endfor %}.

For example, you could use the following to display a list of athletes given a variable athlete_list:

<ul>

{% for athlete in athlete_list %}

<li>{{ athlete.name }}</li>

{% endfor %}

</ul>

Add reversed to the tag to loop over the list in reverse:

{% for athlete in athlete_list reversed %}

...

{% endfor %}

It’s possible to nest {% for %} tags:

{% for athlete in athlete_list %}

<h1>{{ athlete.name }}</h1>

<ul>

{% for sport in athlete.sports_played %}

<li>{{ sport }}</li>

{% endfor %}

6.3. Basic Template Tags and Filters 45

The Django Book, Release 2.0

</ul>

{% endfor %}

A common pattern is to check the size of the list before looping over it, and outputting some special text if the list is

empty:

{% if athlete_list %}

{% for athlete in athlete_list %}

<p>{{ athlete.name }}</p>

{% endfor %}

{% else %}

<p>There are no athletes. Only computer programmers.</p>

{% endif %}

Because this pattern is so common, the for tag supports an optional {% empty %} clause that lets you deﬁne what

to output if the list is empty. This example is equivalent to the previous one:

{% for athlete in athlete_list %}

<p>{{ athlete.name }}</p>

{% empty %}

<p>There are no athletes. Only computer programmers.</p>

{% endfor %}

There is no support for “breaking out” of a loop before the loop is ﬁnished. If you want to accomplish this, change the

variable you’re looping over so that it includes only the values you want to loop over. Similarly, there is no support for

a “continue” statement that would instruct the loop processor to return immediately to the front of the loop. (See the

section “Philosophies and Limitations” later in this chapter for the reasoning behind this design decision.)

Within each {% for %} loop, you get access to a template variable called forloop. This variable has a few

attributes that give you information about the progress of the loop:

•forloop.counter is always set to an integer representing the number of times the loop has been entered.

This is one-indexed, so the ﬁrst time through the loop, forloop.counter will be set to 1. Here’s an example:

{% for item in todo_list %}

<p>{{ forloop.counter }}: {{ item }}</p>

{% endfor %}

•forloop.counter0 is like forloop.counter, except it’s zero-indexed. Its value will be set to 0the

ﬁrst time through the loop.

•forloop.revcounter is always set to an integer representing the number of remaining items in the loop.

The ﬁrst time through the loop, forloop.revcounter will be set to the total number of items in the se-

quence you’re traversing. The last time through the loop, forloop.revcounter will be set to 1.

•forloop.revcounter0 is like forloop.revcounter, except it’s zero-indexed. The ﬁrst time through

the loop, forloop.revcounter0 will be set to the number of elements in the sequence minus 1. The last

time through the loop, it will be set to 0.

•forloop.first is a Boolean value set to True if this is the ﬁrst time through the loop. This is convenient

for special-casing:

{% for object in objects %}

{% if forloop.first %}<li class="first">{% else %}<li>{% endif %}

</li>

{% endfor %}

•forloop.last is a Boolean value set to True if this is the last time through the loop. A common use for

this is to put pipe characters between a list of links:

46 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

{% for link in links %}{{ link }}{% if not forloop.last %} | {% endif %}{% endfor %}

The above template code might output something like this:

Link1 |Link2 |Link3 |Link4

Another common use for this is to put a comma between words in a list:

Favorite places:

{% for p in places %}{{ p }}{% if not forloop.last %}, {% endif %}{% endfor %}

•forloop.parentloop is a reference to the forloop object for the parent loop, in case of nested loops.

Here’s an example:

{% for country in countries %}

<table>

{% for city in country.city_list %}

<tr>

<td>Country #{{ forloop.parentloop.counter }}</td>

<td>City #{{ forloop.counter }}</td>

</tr>

{% endfor %}

</table>

{% endfor %}

The magic forloop variable is only available within loops. After the template parser has reached {% endfor %},

forloop disappears.

Context and the forloop Variable

Inside the {% for %} block, the existing variables are moved out of the way to avoid overwriting the magic

forloop variable. Django exposes this moved context in forloop.parentloop. You generally don’t need

to worry about this, but if you supply a template variable named forloop (though we advise against it), it will be

named forloop.parentloop while inside the {% for %} block.

ifequal/ifnotequal

The Django template system deliberately is not a full-ﬂedged programming language and thus does not allow you to

execute arbitrary Python statements. (More on this idea in the section “Philosophies and Limitations.”) However, it’s

quite a common template requirement to compare two values and display something if they’re equal – and Django

provides an {% ifequal %} tag for that purpose.

The {% ifequal %} tag compares two values and displays everything between {% ifequal %} and {%

endifequal %} if the values are equal.

This example compares the template variables user and currentuser:

{% ifequal user currentuser %}

<h1>Welcome!</h1>

{% endifequal %}

The arguments can be hard-coded strings, with either single or double quotes, so the following is valid:

{% ifequal section ’sitenews’ %}

{% endifequal %}

6.3. Basic Template Tags and Filters 47

The Django Book, Release 2.0

{% ifequal section "community" %}

<h1>Community</h1>

{% endifequal %}

Just like {% if %}, the {% ifequal %} tag supports an optional {% else %}:

{% ifequal section ’sitenews’ %}

{% else %}

{% endifequal %}

Only template variables, strings, integers, and decimal numbers are allowed as arguments to {% ifequal %}.

These are valid examples:

{% ifequal variable 1 %}

{% ifequal variable 1.23 %}

{% ifequal variable ’foo’ %}

{% ifequal variable "foo" %}

Any other types of variables, such as Python dictionaries, lists, or Booleans, can’t be hard-coded in {% ifequal

%}. These are invalid examples:

{% ifequal variable True %}

{% ifequal variable [1, 2, 3] %}

{% ifequal variable {’key’: ’value’} %}

If you need to test whether something is true or false, use the {% if %} tags instead of {% ifequal %}.

Comments

Just as in HTML or Python, the Django template language allows for comments. To designate a comment, use {#

#}:

{# This is a comment #}

The comment will not be output when the template is rendered.

Comments using this syntax cannot span multiple lines. This limitation improves template parsing performance. In

the following template, the rendered output will look exactly the same as the template (i.e., the comment tag will not

be parsed as a comment):

This is a {# this is not

a comment #}

test.

If you want to use multi-line comments, use the {% comment %} template tag, like this:

{% comment %}

This is a

multi-line comment.

{% endcomment %}

6.3.2 Filters

As explained earlier in this chapter, template ﬁlters are simple ways of altering the value of variables before they’re

displayed. Filters use a pipe character, like this:

48 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

This displays the value of the {{ name }} variable after being ﬁltered through the lower ﬁlter, which converts

text to lowercase.

Filters can be chained – that is, they can be used in tandem such that the output of one ﬁlter is applied to the next.

Here’s an example that takes the ﬁrst element in a list and converts it to uppercase:

Some ﬁlters take arguments. A ﬁlter argument comes after a colon and is always in double quotes. For example:

This displays the ﬁrst 30 words of the bio variable.

The following are a few of the most important ﬁlters. Appendix E covers the rest.

•addslashes: Adds a backslash before any backslash, single quote, or double quote. This is useful if the

produced text is included in a JavaScript string.

•date: Formats a date or datetime object according to a format string given in the parameter, for example:

Format strings are deﬁned in Appendix E.

•length: Returns the length of the value. For a list, this returns the number of elements. For a string, this

returns the number of characters. (Python experts, take note that this works on any Python object that knows

how to determine its length – i.e., any object that has a __len__() method.)

6.4 Philosophies and Limitations

Now that you’ve gotten a feel for the Django template language, we should point out some of its intentional limitations,

along with some philosophies behind why it works the way it works.

More than any other component of Web applications, template syntax is highly subjective, and programmers’ opinions

vary wildly. The fact that Python alone has dozens, if not hundreds, of open source template-language implementations

supports this point. Each was likely created because its developer deemed all existing template languages inadequate.

(In fact, it is said to be a rite of passage for a Python developer to write his or her own template language! If you

haven’t done this yet, consider it. It’s a fun exercise.)

With that in mind, you might be interested to know that Django doesn’t require that you use its template language.

Because Django is intended to be a full-stack Web framework that provides all the pieces necessary for Web developers

to be productive, many times it’s more convenient to use Django’s template system than other Python template libraries,

but it’s not a strict requirement in any sense. As you’ll see in the upcoming section “Using Templates in Views”, it’s

very easy to use another template language with Django.

Still, it’s clear we have a strong preference for the way Django’s template language works. The template system has

roots in how Web development is done at World Online and the combined experience of Django’s creators. Here are a

few of those philosophies:

•Business logic should be separated from presentation logic. Django’s developers see a template system as a tool

that controls presentation and presentation-related logic – and that’s it. The template system shouldn’t support

functionality that goes beyond this basic goal.

For that reason, it’s impossible to call Python code directly within Django templates. All “programming” is

fundamentally limited to the scope of what template tags can do. It is possible to write custom template tags that

6.4. Philosophies and Limitations 49

The Django Book, Release 2.0

do arbitrary things, but the out-of-the-box Django template tags intentionally do not allow for arbitrary Python

code execution.

•Syntax should be decoupled from HTML/XML. Although Django’s template system is used primarily to produce

HTML, it’s intended to be just as usable for non-HTML formats, such as plain text. Some other template

languages are XML based, placing all template logic within XML tags or attributes, but Django deliberately

avoids this limitation. Requiring valid XML to write templates introduces a world of human mistakes and hard-

to-understand error messages, and using an XML engine to parse templates incurs an unacceptable level of

overhead in template processing.

•Designers are assumed to be comfortable with HTML code. The template system isn’t designed so that templates

necessarily are displayed nicely in WYSIWYG editors such as Dreamweaver. That is too severe a limitation and

wouldn’t allow the syntax to be as friendly as it is. Django expects template authors to be comfortable editing

HTML directly.

•Designers are assumed not to be Python programmers. The template system authors recognize that Web page

templates are most often written by designers, not programmers, and therefore should not assume Python knowl-

edge.

However, the system also intends to accommodate small teams in which the templates are created by Python

programmers. It offers a way to extend the system’s syntax by writing raw Python code. (More on this in

Chapter 9.)

•The goal is not to invent a programming language. The goal is to offer just enough programming-esque func-

tionality, such as branching and looping, that is essential for making presentation-related decisions.

6.5 Using Templates in Views

You’ve learned the basics of using the template system; now let’s use this knowledge to create a view. Recall the

current_datetime view in mysite.views, which we started in the previous chapter. Here’s what it looks

like:

from django.http import HttpResponse

import datetime

def current_datetime(request):

now =datetime.datetime.now()

html ="<html><body>It is now %s.</body></html>" %now

return HttpResponse(html)

Let’s change this view to use Django’s template system. At ﬁrst, you might think to do something like this:

from django.template import Template, Context

from django.http import HttpResponse

import datetime

def current_datetime(request):

now =datetime.datetime.now()

t=Template("<html><body>It is now {{ current_date }}.</body></html>")

html =t.render(Context({’current_date’: now}))

return HttpResponse(html)

Sure, that uses the template system, but it doesn’t solve the problems we pointed out in the introduction of this chapter.

Namely, the template is still embedded in the Python code, so true separation of data and presentation isn’t achieved.

Let’s ﬁx that by putting the template in a separate ﬁle, which this view will load.

You might ﬁrst consider saving your template somewhere on your ﬁlesystem and using Python’s built-in ﬁle-opening

functionality to read the contents of the template. Here’s what that might look like, assuming the template was saved

50 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

as the ﬁle /home/djangouser/templates/mytemplate.html:

from django.template import Template, Context

from django.http import HttpResponse

import datetime

def current_datetime(request):

now =datetime.datetime.now()

# Simple way of using templates from the filesystem.

# This is BAD because it doesn’t account for missing files!

fp =open(’/home/djangouser/templates/mytemplate.html’)

t=Template(fp.read())

fp.close()

html =t.render(Context({’current_date’: now}))

return HttpResponse(html)

This approach, however, is inelegant for these reasons:

• It doesn’t handle the case of a missing ﬁle. If the ﬁle mytemplate.html doesn’t exist or isn’t readable, the

open() call will raise an IOError exception.

• It hard-codes your template location. If you were to use this technique for every view function, you’d be

duplicating the template locations. Not to mention it involves a lot of typing!

• It includes a lot of boring boilerplate code. You’ve got better things to do than to write calls to open(),

fp.read(), and fp.close() each time you load a template.

To solve these issues, we’ll use template loading and template directories.

6.6 Template Loading

Django provides a convenient and powerful API for loading templates from the ﬁlesystem, with the goal of removing

redundancy both in your template-loading calls and in your templates themselves.

In order to use this template-loading API, ﬁrst you’ll need to tell the framework where you store your templates. The

place to do this is in your settings ﬁle – the settings.py ﬁle that we mentioned last chapter, when we introduced

the ROOT_URLCONF setting.

If you’re following along, open your settings.py and ﬁnd the TEMPLATE_DIRS setting. By default, it’s an

empty tuple, likely containing some auto-generated comments:

TEMPLATE_DIRS =(

# Put strings here, like "/home/html/django_templates" or "C:/www/django/templates".

# Always use forward slashes, even on Windows.

# Don’t forget to use absolute paths, not relative paths.

)

This setting tells Django’s template-loading mechanism where to look for templates. Pick a directory where you’d like

to store your templates and add it to TEMPLATE_DIRS, like so:

TEMPLATE_DIRS =(

’/home/django/mysite/templates’,

)

There are a few things to note:

• You can specify any directory you want, as long as the directory and templates within that directory are read-

able by the user account under which your Web server runs. If you can’t think of an appropriate place to put

6.6. Template Loading 51

The Django Book, Release 2.0

your templates, we recommend creating a templates directory within your project (i.e., within the mysite

directory you created in Chapter 2).

• If your TEMPLATE_DIRS contains only one directory, don’t forget the comma at the end of the directory string!

Bad:

# Missing comma!

TEMPLATE_DIRS =(

’/home/django/mysite/templates’

)

Good:

# Comma correctly in place.

TEMPLATE_DIRS =(

’/home/django/mysite/templates’,

)

The reason for this is that Python requires commas within single-element tuples to disambiguate the tuple from

a parenthetical expression. This is a common newbie gotcha.

• If you’re on Windows, include your drive letter and use Unix-style forward slashes rather than backslashes, as

follows:

TEMPLATE_DIRS =(

’C:/www/django/templates’,

)

• It’s simplest to use absolute paths (i.e., directory paths that start at the root of the ﬁlesystem). If you want to be

a bit more ﬂexible and decoupled, though, you can take advantage of the fact that Django settings ﬁles are just

Python code by constructing the contents of TEMPLATE_DIRS dynamically. For example:

import os.path

TEMPLATE_DIRS =(

os.path.join(os.path.dirname(__file__), ’templates’).replace(’\\’,’/’),

)

This example uses the “magic” Python variable __file__, which is automatically set to the ﬁle name of

the Python module in which the code lives. It gets the name of the directory that contains settings.py

(os.path.dirname), then joins that with templates in a cross-platform way (os.path.join), then

ensures that everything uses forward slashes instead of backslashes (in case of Windows).

While we’re on the topic of dynamic Python code in settings ﬁles, we should point out that it’s very important to

avoid Python errors in your settings ﬁle. If you introduce a syntax error, or a runtime error, your Django-powered

site will likely crash.

With TEMPLATE_DIRS set, the next step is to change the view code to use Django’s template-loading functionality

rather than hard-coding the template paths. Returning to our current_datetime view, let’s change it like so:

from django.template.loader import get_template

from django.template import Context

from django.http import HttpResponse

import datetime

def current_datetime(request):

now =datetime.datetime.now()

t=get_template(’current_datetime.html’)

html =t.render(Context({’current_date’: now}))

return HttpResponse(html)

52 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

In this example, we’re using the function django.template.loader.get_template() rather than loading

the template from the ﬁlesystem manually. The get_template() function takes a template name as its argument,

ﬁgures out where the template lives on the ﬁlesystem, opens that ﬁle, and returns a compiled Template object.

Our template in this example is current_datetime.html, but there’s nothing special about that .html exten-

sion. You can give your templates whatever extension makes sense for your application, or you can leave off extensions

entirely.

To determine the location of the template on your ﬁlesystem, get_template() combines your template direc-

tories from TEMPLATE_DIRS with the template name that you pass to get_template(). For example, if your

TEMPLATE_DIRS is set to ’/home/django/mysite/templates’, the above get_template() call would

look for the template /home/django/mysite/templates/current_datetime.html.

If get_template() cannot ﬁnd the template with the given name, it raises a TemplateDoesNotExist excep-

tion. To see what that looks like, ﬁre up the Django development server again by running python manage.py

runserver within your Django project’s directory. Then, point your browser at the page that activates the

current_datetime view (e.g., http://127.0.0.1:8000/time/). Assuming your DEBUG setting is set

to True and you haven’t yet created a current_datetime.html template, you should see a Django error page

highlighting the TemplateDoesNotExist error.

Figure 6.1: Figure 4-1: The error page shown when a template cannot be found.

This error page is similar to the one we explained in Chapter 3, with one additional piece of debugging information:

a “Template-loader postmortem” section. This section tells you which templates Django tried to load, along with the

reason each attempt failed (e.g., “File does not exist”). This information is invaluable when you’re trying to debug

template-loading errors.

Moving along, create the current_datetime.html ﬁle within your template directory using the following tem-

plate code:

<html><body>It is now {{ current_date }}.</body></html>

Refresh the page in your Web browser, and you should see the fully rendered page.

6.6. Template Loading 53

The Django Book, Release 2.0

6.6.1 render()

We’ve shown you how to load a template, ﬁll a Context and return an HttpResponse object with the result of

the rendered template. We’ve optimized it to use get_template() instead of hard-coding templates and template

paths. But it still requires a fair amount of typing to do those things. Because this is such a common idiom, Django

provides a shortcut that lets you load a template, render it and return an HttpResponse – all in one line of code.

This shortcut is a function called render(), which lives in the module django.shortcuts. Most of the time,

you’ll be using render() rather than loading templates and creating Context and HttpResponse objects man-

ually – unless your employer judges your work by total lines of code written, that is.

Here’s the ongoing current_datetime example rewritten to use render():

from django.shortcuts import render

import datetime

def current_datetime(request):

now =datetime.datetime.now()

return render(request, ’current_datetime.html’, {’current_date’: now})

What a difference! Let’s step through the code changes:

• We no longer have to import get_template,Template,Context, or HttpResponse. Instead, we

import django.shortcuts.render. The import datetime remains.

• Within the current_datetime function, we still calculate now, but the template loading, context creation,

template rendering, and HttpResponse creation are all taken care of by the render() call. Because

render() returns an HttpResponse object, we can simply return that value in the view.

The ﬁrst argument to render() is the request, the second is the name of the template to use. The third argument, if

given, should be a dictionary to use in creating a Context for that template. If you don’t provide a third argument,

render() will use an empty dictionary.

6.6.2 Subdirectories in get_template()

It can get unwieldy to store all of your templates in a single directory. You might like to store templates in subdirecto-

ries of your template directory, and that’s ﬁne. In fact, we recommend doing so; some more advanced Django features

(such as the generic views system, which we cover in Chapter 11) expect this template layout as a default convention.

Storing templates in subdirectories of your template directory is easy. In your calls to get_template(), just

include the subdirectory name and a slash before the template name, like so:

t=get_template(’dateapp/current_datetime.html’)

Because render() is a small wrapper around get_template(), you can do the same thing with the second

argument to render(), like this:

return render(request, ’dateapp/current_datetime.html’, {’current_date’: now})

There’s no limit to the depth of your subdirectory tree. Feel free to use as many subdirectories as you like.

Note: Windows users, be sure to use forward slashes rather than backslashes. get_template() assumes a Unix-

style ﬁle name designation.

54 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

6.6.3 The include Template Tag

Now that we’ve covered the template-loading mechanism, we can introduce a built-in template tag that takes advantage

of it: {% include %}. This tag allows you to include the contents of another template. The argument to the tag

should be the name of the template to include, and the template name can be either a variable or a hard-coded (quoted)

string, in either single or double quotes. Anytime you have the same code in multiple templates, consider using an {%

include %} to remove the duplication.

These two examples include the contents of the template nav.html. The examples are equivalent and illustrate that

either single or double quotes are allowed:

{% include ’nav.html’ %}

{% include "nav.html" %}

This example includes the contents of the template includes/nav.html:

{% include ’includes/nav.html’ %}

This example includes the contents of the template whose name is contained in the variable template_name:

{% include template_name %}

As in get_template(), the ﬁle name of the template is determined by adding the template directory from

TEMPLATE_DIRS to the requested template name.

Included templates are evaluated with the context of the template that’s including them. For example, consider these

two templates:

# mypage.html

<html>

<body>

{% include "includes/nav.html" %}

<h1>{{ title }}</h1>

</body>

</html>

# includes/nav.html

You are in: {{ current_section }}

</div>

If you render mypage.html with a context containing current_section, then the variable will be available in

the “included” template, as you would expect.

If, in an {% include %} tag, a template with the given name isn’t found, Django will do one of two things:

• If DEBUG is set to True, you’ll see the TemplateDoesNotExist exception on a Django error page.

• If DEBUG is set to False, the tag will fail silently, displaying nothing in the place of the tag.

6.7 Template Inheritance

Our template examples so far have been tiny HTML snippets, but in the real world, you’ll be using Django’s template

system to create entire HTML pages. This leads to a common Web development problem: across a Web site, how does

one reduce the duplication and redundancy of common page areas, such as sitewide navigation?

6.7. Template Inheritance 55

The Django Book, Release 2.0

A classic way of solving this problem is to use server-side includes, directives you can embed within your HTML

pages to “include” one Web page inside another. Indeed, Django supports that approach, with the {% include

%} template tag just described. But the preferred way of solving this problem with Django is to use a more elegant

strategy called template inheritance.

In essence, template inheritance lets you build a base “skeleton” template that contains all the common parts of your

site and deﬁnes “blocks” that child templates can override.

Let’s see an example of this by creating a more complete template for our current_datetime view, by editing the

current_datetime.html ﬁle:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN">

<head>

<title>The current time</title>

</head>

<body>

<h1>My helpful timestamp site</h1>

<p>It is now {{ current_date }}.</p>

<hr>

<p>Thanks for visiting my site.</p>

</body>

</html>

That looks just ﬁne, but what happens when we want to create a template for another view – say, the hours_ahead

view from Chapter 3? If we want again to make a nice, valid, full HTML template, we’d create something like:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN">

<head>

<title>Future time</title>

</head>

<body>

<h1>My helpful timestamp site</h1>

<p>In {{ hour_offset }} hour(s), it will be {{ next_time }}.</p>

<hr>

<p>Thanks for visiting my site.</p>

</body>

</html>

Clearly, we’ve just duplicated a lot of HTML. Imagine if we had a more typical site, including a navigation bar, a few

style sheets, perhaps some JavaScript – we’d end up putting all sorts of redundant HTML into each template.

The server-side include solution to this problem is to factor out the common bits in both templates and save them in

separate template snippets, which are then included in each template. Perhaps you’d store the top bit of the template

in a ﬁle called header.html:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN">

<head>

And perhaps you’d store the bottom bit in a ﬁle called footer.html:

<hr>

<p>Thanks for visiting my site.</p>

</body>

</html>

56 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

With an include-based strategy, headers and footers are easy. It’s the middle ground that’s messy. In this example, both

pages feature a title – <h1>My helpful timestamp site</h1> – but that title can’t ﬁt into header.html

because the <title> on both pages is different. If we included the <h1> in the header, we’d have to include the

<title>, which wouldn’t allow us to customize it per page. See where this is going?

Django’s template inheritance system solves these problems. You can think of it as an “inside-out” version of server-

side includes. Instead of deﬁning the snippets that are common, you deﬁne the snippets that are different.

The ﬁrst step is to deﬁne a base template – a skeleton of your page that child templates will later ﬁll in. Here’s a base

template for our ongoing example:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN">

<head>

<title>{% block title %}{% endblock %}</title>

</head>

<body>

<h1>My helpful timestamp site</h1>

{% block content %}{% endblock %}

{% block footer %}

<hr>

<p>Thanks for visiting my site.</p>

{% endblock %}

</body>

</html>

This template, which we’ll call base.html, deﬁnes a simple HTML skeleton document that we’ll use for all the

pages on the site. It’s the job of child templates to override, or add to, or leave alone the contents of the blocks. (If

you’re following along, save this ﬁle to your template directory as base.html.)

We’re using a template tag here that you haven’t seen before: the {% block %} tag. All the {% block %} tags

do is tell the template engine that a child template may override those portions of the template.

Now that we have this base template, we can modify our existing current_datetime.html template to use it:

{% extends "base.html" %}

{% block title %}The current time{% endblock %}

{% block content %}

<p>It is now {{ current_date }}.</p>

{% endblock %}

While we’re at it, let’s create a template for the hours_ahead view from Chapter 3. (If you’re following along with

code, we’ll leave it up to you to change hours_ahead to use the template system instead of hard-coded HTML.)

Here’s what that could look like:

{% extends "base.html" %}

{% block title %}Future time{% endblock %}

{% block content %}

<p>In {{ hour_offset }} hour(s), it will be {{ next_time }}.</p>

{% endblock %}

Isn’t this beautiful? Each template contains only the code that’s unique to that template. No redundancy needed. If

you need to make a site-wide design change, just make the change to base.html, and all of the other templates will

immediately reﬂect the change.

Here’s how it works. When you load the template current_datetime.html, the template engine sees the {%

6.7. Template Inheritance 57

The Django Book, Release 2.0

extends %} tag, noting that this template is a child template. The engine immediately loads the parent template –

in this case, base.html.

At that point, the template engine notices the three {% block %} tags in base.html and replaces those blocks

with the contents of the child template. So, the title we’ve deﬁned in {% block title %} will be used, as will

the {% block content %}.

Note that since the child template doesn’t deﬁne the footer block, the template system uses the value from the parent

template instead. Content within a {% block %} tag in a parent template is always used as a fallback.

Inheritance doesn’t affect the template context. In other words, any template in the inheritance tree will have access to

every one of your template variables from the context.

You can use as many levels of inheritance as needed. One common way of using inheritance is the following three-level

approach:

1. Create a base.html template that holds the main look and feel of your site. This is the stuff that rarely, if

ever, changes.

2. Create a base_SECTION.html template for each “section” of your site (e.g., base_photos.html and

base_forum.html). These templates extend base.html and include section-speciﬁc styles/design.

3. Create individual templates for each type of page, such as a forum page or a photo gallery. These templates

extend the appropriate section template.

This approach maximizes code reuse and makes it easy to add items to shared areas, such as section-wide navigation.

Here are some guidelines for working with template inheritance:

• If you use {% extends %} in a template, it must be the ﬁrst template tag in that template. Otherwise,

template inheritance won’t work.

• Generally, the more {% block %} tags in your base templates, the better. Remember, child templates don’t

have to deﬁne all parent blocks, so you can ﬁll in reasonable defaults in a number of blocks, and then deﬁne

only the ones you need in the child templates. It’s better to have more hooks than fewer hooks.

• If you ﬁnd yourself duplicating code in a number of templates, it probably means you should move that code to

a{% block %} in a parent template.

• If you need to get the content of the block from the parent template, use {{ block.super }}, which is a

“magic” variable providing the rendered text of the parent template. This is useful if you want to add to the

contents of a parent block instead of completely overriding it.

• You may not deﬁne multiple {% block %} tags with the same name in the same template. This limitation

exists because a block tag works in “both” directions. That is, a block tag doesn’t just provide a hole to ﬁll, it

also deﬁnes the content that ﬁlls the hole in the parent. If there were two similarly named {% block %} tags

in a template, that template’s parent wouldn’t know which one of the blocks’ content to use.

• The template name you pass to {% extends %} is loaded using the same method that get_template()

uses. That is, the template name is appended to your TEMPLATE_DIRS setting.

• In most cases, the argument to {% extends %} will be a string, but it can also be a variable, if you don’t

know the name of the parent template until runtime. This lets you do some cool, dynamic stuff.

6.8 What’s next?

You now have the basics of Django’s template system under your belt. What’s next?

Many modern Web sites are database-driven: the content of the Web site is stored in a relational database. This allows

a clean separation of data and logic (in the same way views and templates allow the separation of logic and display.)

58 Chapter 6. Chapter 4: Templates

The Django Book, Release 2.0

The next chapter Chapter 5 covers the tools Django gives you to interact with a database.

6.8. What’s next? 59

The Django Book, Release 2.0

60 Chapter 6. Chapter 4: Templates

CHAPTER 7

Chapter 5: Models

In Chapter 3, we covered the fundamentals of building dynamic Web sites with Django: setting up views and URL-

confs. As we explained, a view is responsible for doing some arbitrary logic, and then returning a response. In one of

the examples, our arbitrary logic was to calculate the current date and time.

In modern Web applications, the arbitrary logic often involves interacting with a database. Behind the scenes, a

database-driven Web site connects to a database server, retrieves some data out of it, and displays that data on a Web

page. The site might also provide ways for site visitors to populate the database on their own.

Many complex Web sites provide some combination of the two. Amazon.com, for instance, is a great example of a

database-driven site. Each product page is essentially a query into Amazon’s product database formatted as HTML,

and when you post a customer review, it gets inserted into the database of reviews.

Django is well suited for making database-driven Web sites, because it comes with easy yet powerful tools for per-

forming database queries using Python. This chapter explains that functionality: Django’s database layer.

(Note: While it’s not strictly necessary to know basic relational database theory and SQL in order to use Django’s

database layer, it’s highly recommended. An introduction to those concepts is beyond the scope of this book, but keep

reading even if you’re a database newbie. You’ll probably be able to follow along and grasp concepts based on the

context.)

7.1 The “Dumb” Way to Do Database Queries in Views

Just as Chapter 3 detailed a “dumb” way to produce output within a view (by hard-coding the text directly within the

view), there’s a “dumb” way to retrieve data from a database in a view. It’s simple: just use any existing Python library

to execute an SQL query and do something with the results.

In this example view, we use the MySQLdb library (available via http://www.djangoproject.com/r/python-mysql/) to

connect to a MySQL database, retrieve some records, and feed them to a template for display as a Web page:

from django.shortcuts import render

import MySQLdb

def book_list(request):

db =MySQLdb.connect(user=’me’, db=’mydb’, passwd=’secret’, host=’localhost’)

cursor =db.cursor()

cursor.execute(’SELECT name FROM books ORDER BY name’)

names =[row[0]for row in cursor.fetchall()]

db.close()

return render(request, ’book_list.html’, {’names’: names})

This approach works, but some problems should jump out at you immediately:

The Django Book, Release 2.0

• We’re hard-coding the database connection parameters. Ideally, these parameters would be stored in the Django

conﬁguration.

• We’re having to write a fair bit of boilerplate code: creating a connection, creating a cursor, executing a state-

ment, and closing the connection. Ideally, all we’d have to do is specify which results we wanted.

• It ties us to MySQL. If, down the road, we switch from MySQL to PostgreSQL, we’ll have to use a different

database adapter (e.g., psycopg rather than MySQLdb), alter the connection parameters, and – depending on

the nature of the SQL statement – possibly rewrite the SQL. Ideally, the database server we’re using would be

abstracted, so that a database server change could be made in a single place. (This feature is particularly relevant

if you’re building an open-source Django application that you want to be used by as many people as possible.)

As you might expect, Django’s database layer aims to solve these problems. Here’s a sneak preview of how the

previous view can be rewritten using Django’s database API:

from django.shortcuts import render

from mysite.books.models import Book

def book_list(request):

books =Book.objects.order_by(’name’)

return render(request, ’book_list.html’, {’books’: books})

We’ll explain this code a little later in the chapter. For now, just get a feel for how it looks.

7.2 The MTV (or MVC) Development Pattern

Before we delve into any more code, let’s take a moment to consider the overall design of a database-driven Django

Web application.

As we mentioned in previous chapters, Django is designed to encourage loose coupling and strict separation between

pieces of an application. If you follow this philosophy, it’s easy to make changes to one particular piece of the appli-

cation without affecting the other pieces. In view functions, for instance, we discussed the importance of separating

the business logic from the presentation logic by using a template system. With the database layer, we’re applying that

same philosophy to data access logic.

Those three pieces together – data access logic, business logic, and presentation logic – comprise a concept that’s

sometimes called the Model-View-Controller (MVC) pattern of software architecture. In this pattern, “Model” refers

to the data access layer, “View” refers to the part of the system that selects what to display and how to display it, and

“Controller” refers to the part of the system that decides which view to use, depending on user input, accessing the

model as needed.

Why the Acronym?

The goal of explicitly deﬁning patterns such as MVC is mostly to streamline communication among developers.

Instead of having to tell your coworkers, “Let’s make an abstraction of the data access, then let’s have a separate layer

that handles data display, and let’s put a layer in the middle that regulates this,” you can take advantage of a shared

vocabulary and say, “Let’s use the MVC pattern here.”

Django follows this MVC pattern closely enough that it can be called an MVC framework. Here’s roughly how the

M, V, and C break down in Django:

•M, the data-access portion, is handled by Django’s database layer, which is described in this chapter.

•V, the portion that selects which data to display and how to display it, is handled by views and templates.

•C, the portion that delegates to a view depending on user input, is handled by the framework itself by following

your URLconf and calling the appropriate Python function for the given URL.

62 Chapter 7. Chapter 5: Models

The Django Book, Release 2.0

Because the “C” is handled by the framework itself and most of the excitement in Django happens in models, templates

and views, Django has been referred to as an MTV framework. In the MTV development pattern,

•Mstands for “Model,” the data access layer. This layer contains anything and everything about the data: how to

access it, how to validate it, which behaviors it has, and the relationships between the data.

•Tstands for “Template,” the presentation layer. This layer contains presentation-related decisions: how some-

thing should be displayed on a Web page or other type of document.

•Vstands for “View,” the business logic layer. This layer contains the logic that access the model and defers to

the appropriate template(s). You can think of it as the bridge between models and templates.

If you’re familiar with other MVC Web-development frameworks, such as Ruby on Rails, you may consider Django

views to be the “controllers” and Django templates to be the “views.” This is an unfortunate confusion brought

about by differing interpretations of MVC. In Django’s interpretation of MVC, the “view” describes the data that gets

presented to the user; it’s not necessarily just how the data looks, but which data is presented. In contrast, Ruby on

Rails and similar frameworks suggest that the controller’s job includes deciding which data gets presented to the user,

whereas the view is strictly how the data looks, not which data is presented.

Neither interpretation is more “correct” than the other. The important thing is to understand the underlying concepts.

7.3 Conﬁguring the Database

With all of that philosophy in mind, let’s start exploring Django’s database layer. First, we need to take care of some

initial conﬁguration; we need to tell Django which database server to use and how to connect to it.

We’ll assume you’ve set up a database server, activated it, and created a database within it (e.g., using a CREATE

DATABASE statement). If you’re using SQLite, no such setup is required, because SQLite uses standalone ﬁles on the

ﬁlesystem to store its data.

As with TEMPLATE_DIRS in the previous chapter, database conﬁguration lives in the Django settings ﬁle, called

settings.py by default. Edit that ﬁle and look for the database settings:

DATABASES ={

’default’: {

’ENGINE’:’django.db.backends.’,# Add ’postgresql_psycopg2’, ’mysql’, ’sqlite3’ or ’oracle’.

’NAME’:’’,# Or path to database file if using sqlite3.

’USER’:’’,# Not used with sqlite3.

’PASSWORD’:’’,# Not used with sqlite3.

’HOST’:’’,# Set to empty string for localhost. Not used with sqlite3.

’PORT’:’’,# Set to empty string for default. Not used with sqlite3.

}

Here’s a rundown of each setting.

•ENGINE tells Django which database engine to use. If you’re using a database with Django, ENGINE must be

set to one of the strings shown in Table 5-1.

7.3. Conﬁguring the Database 63

The Django Book, Release 2.0

Table 7.1: Table 5-1. Database Engine Settings

Setting Database Required Adapter

django.db.backends.postgresql_psycopg2Post-

greSQL

psycopg version 2.x,

http://www.djangoproject.com/r/python-pgsql/.

django.db.backends.mysql MySQL MySQLdb,

http://www.djangoproject.com/r/python-mysql/.

django.db.backends.sqlite3 SQLite No adapter needed.

django.db.backends.oracle Oracle cx_Oracle,

http://www.djangoproject.com/r/python-oracle/.

Note that for whichever database back-end you use, you’ll need to download and install the appropriate database

adapter. Each one is available for free on the Web; just follow the links in the “Required Adapter” column in

Table 5-1. If you’re on Linux, your distribution’s package-management system might offer convenient packages.

(Look for packages called python-postgresql or python-psycopg, for example.)

Example:

’ENGINE’: ’django.db.backends.postgresql_psycopg2’,

•NAME tells Django the name of your database. For example:

’NAME’: ’mydb’,

If you’re using SQLite, specify the full ﬁlesystem path to the database ﬁle on your ﬁlesystem. For example:

’NAME’: ’/home/django/mydata.db’,

As for where you put that SQLite database, we’re using the /home/django directory in this example, but you

should pick a directory that works best for you.

•USER tells Django which username to use when connecting to your database. For example: If you’re using

SQLite, leave this blank.

•PASSWORD tells Django which password to use when connecting to your database. If you’re using SQLite or

have an empty password, leave this blank.

•HOST tells Django which host to use when connecting to your database. If your database is on the same

computer as your Django installation (i.e., localhost), leave this blank. If you’re using SQLite, leave this blank.

MySQL is a special case here. If this value starts with a forward slash (’/’) and you’re using MySQL, MySQL

will connect via a Unix socket to the speciﬁed socket, for example:

’HOST’: ’/var/run/mysql’,

•PORT tells Django which port to use when connecting to your database. If you’re using SQLite, leave this blank.

Otherwise, if you leave this blank, the underlying database adapter will use whichever port is default for your

given database server. In most cases, the default port is ﬁne, so you can leave this blank.

Once you’ve entered those settings and saved settings.py, it’s a good idea to test your conﬁguration. To do this,

run python manage.py shell as in the last chapter, from within the mysite project directory. (As we pointed

out last chapter manage.py shell is a way to run the Python interpreter with the correct Django settings activated.

This is necessary in our case, because Django needs to know which settings ﬁle to use in order to get your database

connection information.)

In the shell, type these commands to test your database conﬁguration:

>>> from django.db import connection

>>> cursor =connection.cursor()

64 Chapter 7. Chapter 5: Models

The Django Book, Release 2.0

If nothing happens, then your database is conﬁgured properly. Otherwise, check the error message for clues about

what’s wrong. Table 5-2 shows some common errors.

Table 7.2: Table 5-2. Database Conﬁguration Error Messages

Error Message Solution

You haven’t set the ENGINE

setting yet.

Set the ENGINE setting to something other than an empty string. Valid values

are in Table 5-1.

Environment variable

DJANGO_SETTINGS_MODULE

is undeﬁned.

Run the command python manage.py shell rather than python.

Error loading _____ module:

No module named _____.

You haven’t installed the appropriate database-speciﬁc adapter (e.g.,

psycopg or MySQLdb). Adapters are not bundled with Django, so it’s your

responsibility to download and install them on your own.

_____ isn’t an available

database backend.

Set your ENGINE setting to one of the valid engine settings described

previously. Perhaps you made a typo?

database _____ does not exist Change the NAME setting to point to a database that exists, or execute the

appropriate CREATE DATABASE statement in order to create it.

role _____ does not exist Change the USER setting to point to a user that exists, or create the user in

your database.

could not connect to server Make sure HOST and PORT are set correctly, and make sure the database

server is running.

7.4 Your First App

Now that you’ve veriﬁed the connection is working, it’s time to create a Django app – a bundle of Django code,

including models and views, that lives together in a single Python package and represents a full Django application.

It’s worth explaining the terminology here, because this tends to trip up beginners. We’d already created a project, in

Chapter 2, so what’s the difference between a project and an app? The difference is that of conﬁguration vs. code:

• A project is an instance of a certain set of Django apps, plus the conﬁguration for those apps.

Technically, the only requirement of a project is that it supplies a settings ﬁle, which deﬁnes the database

connection information, the list of installed apps, the TEMPLATE_DIRS, and so forth.

• An app is a portable set of Django functionality, usually including models and views, that lives together in a

single Python package.

For example, Django comes with a number of apps, such as a commenting system and an automatic admin

interface. A key thing to note about these apps is that they’re portable and reusable across multiple projects.

There are very few hard-and-fast rules about how you ﬁt your Django code into this scheme. If you’re building a

simple Web site, you may use only a single app. If you’re building a complex Web site with several unrelated pieces

such as an e-commerce system and a message board, you’ll probably want to split those into separate apps so that

you’ll be able to reuse them individually in the future.

Indeed, you don’t necessarily need to create apps at all, as evidenced by the example view functions we’ve created so

far in this book. In those cases, we simply created a ﬁle called views.py, ﬁlled it with view functions, and pointed

our URLconf at those functions. No “apps” were needed.

However, there’s one requirement regarding the app convention: if you’re using Django’s database layer (models), you

must create a Django app. Models must live within apps. Thus, in order to start writing our models, we’ll need to

create a new app.

Within the mysite project directory, type this command to create a books app:

7.4. Your First App 65

The Django Book, Release 2.0

python manage.py startapp books

This command does not produce any output, but it does create a books directory within the mysite directory. Let’s

look at the contents of that directory:

books/

__init__.py

models.py

tests.py

views.py

These ﬁles will contain the models and views for this app.

Have a look at models.py and views.py in your favorite text editor. Both ﬁles are empty, except for comments

and an import in models.py. This is the blank slate for your Django app.

7.5 Deﬁning Models in Python

As we discussed earlier in this chapter, the “M” in “MTV” stands for “Model.” A Django model is a description of the

data in your database, represented as Python code. It’s your data layout – the equivalent of your SQL CREATE TABLE

statements – except it’s in Python instead of SQL, and it includes more than just database column deﬁnitions. Django

uses a model to execute SQL code behind the scenes and return convenient Python data structures representing the

rows in your database tables. Django also uses models to represent higher-level concepts that SQL can’t necessarily

handle.

If you’re familiar with databases, your immediate thought might be, “Isn’t it redundant to deﬁne data models in Python

instead of in SQL?” Django works the way it does for several reasons:

• Introspection requires overhead and is imperfect. In order to provide convenient data-access APIs, Django needs

to know the database layout somehow, and there are two ways of accomplishing this. The ﬁrst way would be

to explicitly describe the data in Python, and the second way would be to introspect the database at runtime to

determine the data models.

This second way seems cleaner, because the metadata about your tables lives in only one place, but it introduces

a few problems. First, introspecting a database at runtime obviously requires overhead. If the framework had

to introspect the database each time it processed a request, or even only when the Web server was initialized,

this would incur an unacceptable level of overhead. (While some believe that level of overhead is acceptable,

Django’s developers aim to trim as much framework overhead as possible.) Second, some databases, notably

older versions of MySQL, do not store sufﬁcient metadata for accurate and complete introspection.

• Writing Python is fun, and keeping everything in Python limits the number of times your brain has to do a

“context switch.” It helps productivity if you keep yourself in a single programming environment/mentality for

as long as possible. Having to write SQL, then Python, and then SQL again is disruptive.

• Having data models stored as code rather than in your database makes it easier to keep your models under

version control. This way, you can easily keep track of changes to your data layouts.

• SQL allows for only a certain level of metadata about a data layout. Most database systems, for example, do not

provide a specialized data type for representing email addresses or URLs. Django models do. The advantage of

higher-level data types is higher productivity and more reusable code.

• SQL is inconsistent across database platforms. If you’re distributing a Web application, for example, it’s much

more pragmatic to distribute a Python module that describes your data layout than separate sets of CREATE

TABLE statements for MySQL, PostgreSQL, and SQLite.

A drawback of this approach, however, is that it’s possible for the Python code to get out of sync with what’s actually

in the database. If you make changes to a Django model, you’ll need to make the same changes inside your database

66 Chapter 7. Chapter 5: Models

The Django Book, Release 2.0

to keep your database consistent with the model. We’ll discuss some strategies for handling this problem later in this

chapter.

Finally, we should note that Django includes a utility that can generate models by introspecting an existing database.

This is useful for quickly getting up and running with legacy data. We’ll cover this in Chapter 18.

7.6 Your First Model

As an ongoing example in this chapter and the next chapter, we’ll focus on a basic book/author/publisher data layout.

We use this as our example because the conceptual relationships between books, authors, and publishers are well

known, and this is a common data layout used in introductory SQL textbooks. You’re also reading a book that was

written by authors and produced by a publisher!

We’ll suppose the following concepts, ﬁelds, and relationships:

• An author has a ﬁrst name, a last name and an email address.

• A publisher has a name, a street address, a city, a state/province, a country, and a Web site.

• A book has a title and a publication date. It also has one or more authors (a many-to-many relationship with

authors) and a single publisher (a one-to-many relationship – aka foreign key – to publishers).

The ﬁrst step in using this database layout with Django is to express it as Python code. In the models.py ﬁle that

was created by the startapp command, enter the following:

from django.db import models

class Publisher(models.Model):

name =models.CharField(max_length=30)

address =models.CharField(max_length=50)

city =models.CharField(max_length=60)

state_province =models.CharField(max_length=30)

country =models.CharField(max_length=50)

website =models.URLField()

class Author(models.Model):

first_name =models.CharField(max_length=30)

last_name =models.CharField(max_length=40)

email =models.EmailField()

class Book(models.Model):

title =models.CharField(max_length=100)

authors =models.ManyToManyField(Author)

publisher =models.ForeignKey(Publisher)

publication_date =models.DateField()

Let’s quickly examine this code to cover the basics. The ﬁrst thing to notice is that each model is represented by a

Python class that is a subclass of django.db.models.Model. The parent class, Model, contains all the machin-

ery necessary to make these objects capable of interacting with a database – and that leaves our models responsible

solely for deﬁning their ﬁelds, in a nice and compact syntax. Believe it or not, this is all the code we need to write to

have basic data access with Django.

Each model generally corresponds to a single database table, and each attribute on a model generally corresponds to

a column in that database table. The attribute name corresponds to the column’s name, and the type of ﬁeld (e.g.,

CharField) corresponds to the database column type (e.g., varchar). For example, the Publisher model is

equivalent to the following table (assuming PostgreSQL CREATE TABLE syntax):

7.6. Your First Model 67

The Django Book, Release 2.0

CREATE TABLE "books_publisher" (

"id" serial NOT NULL PRIMARY KEY,

"name" varchar(30) NOT NULL,

"address" varchar(50) NOT NULL,

"city" varchar(60) NOT NULL,

"state_province" varchar(30) NOT NULL,

"country" varchar(50) NOT NULL,

"website" varchar(200) NOT NULL

);

Indeed, Django can generate that CREATE TABLE statement automatically, as we’ll show you in a moment.

The exception to the one-class-per-database-table rule is the case of many-to-many relationships. In our example

models, Book has a ManyToManyField called authors. This designates that a book has one or many authors,

but the Book database table doesn’t get an authors column. Rather, Django creates an additional table – a many-

to-many “join table” – that handles the mapping of books to authors.

For a full list of ﬁeld types and model syntax options, see Appendix B.

Finally, note we haven’t explicitly deﬁned a primary key in any of these models. Unless you instruct it otherwise,

Django automatically gives every model an auto-incrementing integer primary key ﬁeld called id. Each Django

model is required to have a single-column primary key.

7.7 Installing the Model

We’ve written the code; now let’s create the tables in our database. In order to do that, the ﬁrst step is to activate these

models in our Django project. We do that by adding the books app to the list of “installed apps” in the settings ﬁle.

Edit the settings.py ﬁle again, and look for the INSTALLED_APPS setting. INSTALLED_APPS tells Django

which apps are activated for a given project. By default, it looks something like this:

INSTALLED_APPS =(

’django.contrib.auth’,

’django.contrib.contenttypes’,

’django.contrib.sessions’,

’django.contrib.sites’,

’django.contrib.messages’,

’django.contrib.staticfiles’,

)

Temporarily comment out all six of those strings by putting a hash character (#) in front of them. (They’re included

by default as a common-case convenience, but we’ll activate and discuss them in subsequent chapters.) While you’re

at it, comment out the default MIDDLEWARE_CLASSES setting, too; the default values in MIDDLEWARE_CLASSES

depend on some of the apps we just commented out. Then, add ’books’ to the INSTALLED_APPS list, so the

setting ends up looking like this:

MIDDLEWARE_CLASSES =(

# ’django.middleware.common.CommonMiddleware’,

# ’django.contrib.sessions.middleware.SessionMiddleware’,

# ’django.middleware.csrf.CsrfViewMiddleware’,

# ’django.contrib.auth.middleware.AuthenticationMiddleware’,

# ’django.contrib.messages.middleware.MessageMiddleware’,

)

INSTALLED_APPS =(

# ’django.contrib.auth’,

# ’django.contrib.contenttypes’,

68 Chapter 7. Chapter 5: Models

The Django Book, Release 2.0

# ’django.contrib.sessions’,

# ’django.contrib.sites’,

’books’,

)

(As we mentioned last chapter when setting TEMPLATE_DIRS, you’ll need to be sure to include the trailing comma

in INSTALLED_APPS, because it’s a single-element tuple. By the way, this book’s authors prefer to put a comma

after every element of a tuple, regardless of whether the tuple has only a single element. This avoids the issue of

forgetting commas, and there’s no penalty for using that extra comma.)

’mysite.books’ refers to the books app we’re working on. Each app in INSTALLED_APPS is represented by

its full Python path – that is, the path of packages, separated by dots, leading to the app package.

Now that the Django app has been activated in the settings ﬁle, we can create the database tables in our database. First,

let’s validate the models by running this command:

python manage.py validate

The validate command checks whether your models’ syntax and logic are correct. If all is well, you’ll see the

message 0 errors found. If you don’t, make sure you typed in the model code correctly. The error output should

give you helpful information about what was wrong with the code.

Any time you think you have problems with your models, run python manage.py validate. It tends to catch

all the common model problems.

If your models are valid, run the following command for Django to generate CREATE TABLE statements for your

models in the books app (with colorful syntax highlighting available, if you’re using Unix):

python manage.py sqlall books

In this command, books is the name of the app. It’s what you speciﬁed when you ran the command manage.py

startapp. When you run the command, you should see something like this:

BEGIN;

CREATE TABLE "books_publisher" (

"id" serial NOT NULL PRIMARY KEY,

"name" varchar(30) NOT NULL,

"address" varchar(50) NOT NULL,

"city" varchar(60) NOT NULL,

"state_province" varchar(30) NOT NULL,

"country" varchar(50) NOT NULL,

"website" varchar(200) NOT NULL

)

;

CREATE TABLE "books_author" (

"id" serial NOT NULL PRIMARY KEY,

"first_name" varchar(30) NOT NULL,

"last_name" varchar(40) NOT NULL,

"email" varchar(75) NOT NULL

)

;

CREATE TABLE "books_book" (

"id" serial NOT NULL PRIMARY KEY,

"title" varchar(100) NOT NULL,

"publisher_id" integer NOT NULL REFERENCES "books_publisher" ("id") DEFERRABLE INITIALLY DEFERRED,

"publication_date" date NOT NULL

)

;

CREATE TABLE "books_book_authors" (

"id" serial NOT NULL PRIMARY KEY,

7.7. Installing the Model 69

The Django Book, Release 2.0

"book_id" integer NOT NULL REFERENCES "books_book" ("id") DEFERRABLE INITIALLY DEFERRED,

"author_id" integer NOT NULL REFERENCES "books_author" ("id") DEFERRABLE INITIALLY DEFERRED,

UNIQUE ("book_id", "author_id")

)

;

CREATE INDEX "books_book_publisher_id" ON "books_book" ("publisher_id");

COMMIT;

Note the following:

• Table names are automatically generated by combining the name of the app (books) and the lowercase name

of the model (publisher,book, and author). You can override this behavior, as detailed in Appendix B.

• As we mentioned earlier, Django adds a primary key for each table automatically – the id ﬁelds. You can

override this, too.

• By convention, Django appends "_id" to the foreign key ﬁeld name. As you might have guessed, you can

override this behavior, too.

• The foreign key relationship is made explicit by a REFERENCES statement.

• These CREATE TABLE statements are tailored to the database you’re using, so database-speciﬁc ﬁeld types

such as auto_increment (MySQL), serial (PostgreSQL), or integer primary key (SQLite) are

handled for you automatically. The same goes for quoting of column names (e.g., using double quotes or single

quotes). This example output is in PostgreSQL syntax.

The sqlall command doesn’t actually create the tables or otherwise touch your database – it just prints output to the

screen so you can see what SQL Django would execute if you asked it. If you wanted to, you could copy and paste this

SQL into your database client, or use Unix pipes to pass it directly (e.g., python manage.py sqlall books

| psql mydb). However, Django provides an easier way of committing the SQL to the database: the syncdb

command:

python manage.py syncdb

Run that command, and you’ll see something like this:

Creating table books_publisher

Creating table books_author

Creating table books_book

Installing index for books.Book model

The syncdb command is a simple “sync” of your models to your database. It looks at all of the models in each app in

your INSTALLED_APPS setting, checks the database to see whether the appropriate tables exist yet, and creates the

tables if they don’t yet exist. Note that syncdb does not sync changes in models or deletions of models; if you make

a change to a model or delete a model, and you want to update the database, syncdb will not handle that. (More on

this in the “Making Changes to a Database Schema” section toward the end of this chapter.)

If you run python manage.py syncdb again, nothing happens, because you haven’t added any models to the

books app or added any apps to INSTALLED_APPS. Ergo, it’s always safe to run python manage.py syncdb

– it won’t clobber things.

If you’re interested, take a moment to dive into your database server’s command-line client and see the database tables

Django created. You can manually run the command-line client (e.g., psql for PostgreSQL) or you can run the

command python manage.py dbshell, which will ﬁgure out which command-line client to run, depending on

your DATABASE_SERVER setting. The latter is almost always more convenient.

70 Chapter 7. Chapter 5: Models

The Django Book, Release 2.0

7.8 Basic Data Access

Once you’ve created a model, Django automatically provides a high-level Python API for working with those models.

Try it out by running python manage.py shell and typing the following:

>>> from books.models import Publisher

>>> p1 =Publisher(name=’Apress’, address=’2855 Telegraph Avenue’,

... city=’Berkeley’, state_province=’CA’, country=’U.S.A.’,

... website=’http://www.apress.com/’)

>>> p1.save()

>>> p2 =Publisher(name="O’Reilly", address=’10 Fawcett St.’,

... city=’Cambridge’, state_province=’MA’, country=’U.S.A.’,

... website=’http://www.oreilly.com/’)

>>> p2.save()

>>> publisher_list =Publisher.objects.all()

>>> publisher_list

[<Publisher: Publisher object>, <Publisher: Publisher object>]

These few lines of code accomplish quite a bit. Here are the highlights:

• First, we import our Publisher model class. This lets us interact with the database table that contains pub-

lishers.

• We create a Publisher object by instantiating it with values for each ﬁeld – name,address, etc.

• To save the object to the database, call its save() method. Behind the scenes, Django executes an SQL

INSERT statement here.

• To retrieve publishers from the database, use the attribute Publisher.objects, which you can think

of as a set of all publishers. Fetch a list of all Publisher objects in the database with the statement

Publisher.objects.all(). Behind the scenes, Django executes an SQL SELECT statement here.

One thing is worth mentioning, in case it wasn’t clear from this example. When you’re creating objects using the

Django model API, Django doesn’t save the objects to the database until you call the save() method:

p1 =Publisher(...)

# At this point, p1 is not saved to the database yet!

p1.save()

# Now it is.

If you want to create an object and save it to the database in a single step, use the objects.create() method.

This example is equivalent to the example above:

>>> p1 =Publisher.objects.create(name=’Apress’,

... address=’2855 Telegraph Avenue’,

... city=’Berkeley’, state_province=’CA’, country=’U.S.A.’,

... website=’http://www.apress.com/’)

>>> p2 =Publisher.objects.create(name="O’Reilly",

... address=’10 Fawcett St.’, city=’Cambridge’,

... state_province=’MA’, country=’U.S.A.’,

... website=’http://www.oreilly.com/’)

>>> publisher_list =Publisher.objects.all()

>>> publisher_list

Naturally, you can do quite a lot with the Django database API – but ﬁrst, let’s take care of a small annoyance.

7.8. Basic Data Access 71

The Django Book, Release 2.0

7.9 Adding Model String Representations

When we printed out the list of publishers, all we got was this unhelpful display that makes it difﬁcult to tell the

Publisher objects apart:

[<Publisher: Publisher object>, <Publisher: Publisher object>]

We can ﬁx this easily by adding a method called __unicode__() to our Publisher class. A __unicode__()

method tells Python how to display the “unicode” representation of an object. You can see this in action by adding a

__unicode__() method to the three models:

from django.db import models

class Publisher(models.Model):

name = models.CharField(max_length=30)

address = models.CharField(max_length=50)

city = models.CharField(max_length=60)

state_province = models.CharField(max_length=30)

country = models.CharField(max_length=50)

website = models.URLField()

def __unicode__(self):

return self.name

class Author(models.Model):

first_name = models.CharField(max_length=30)

last_name = models.CharField(max_length=40)

email = models.EmailField()

def __unicode__(self):

return u’%s %s’ % (self.first_name, self.last_name)

class Book(models.Model):

title = models.CharField(max_length=100)

authors = models.ManyToManyField(Author)

publisher = models.ForeignKey(Publisher)

publication_date = models.DateField()

def __unicode__(self):

return self.title

As you can see, a __unicode__() method can do whatever it needs to do in order to return a representation of an

object. Here, the __unicode__() methods for Publisher and Book simply return the object’s name and title,

respectively, but the __unicode__() for Author is slightly more complex – it pieces together the first_name

and last_name ﬁelds, separated by a space.

The only requirement for __unicode__() is that it return a Unicode object. If __unicode__() doesn’t return a

Unicode object – if it returns, say, an integer – then Python will raise a TypeError with a message like "coercing

to Unicode: need string or buffer, int found".

Unicode objects

What are Unicode objects?

You can think of a Unicode object as a Python string that can handle more than a million different types of characters,

from accented versions of Latin characters to non-Latin characters to curly quotes and obscure symbols.

72 Chapter 7. Chapter 5: Models

The Django Book, Release 2.0

Normal Python strings are encoded, which means they use an encoding such as ASCII, ISO-8859-1 or UTF-8. If

you’re storing fancy characters (anything beyond the standard 128 ASCII characters such as 0-9 and A-Z) in a normal

Python string, you have to keep track of which encoding your string is using, or the fancy characters might appear

messed up when they’re displayed or printed. Problems occur when you have data that’s stored in one encoding and

you try to combine it with data in a different encoding, or you try to display it in an application that assumes a certain

encoding. We’ve all seen Web pages and e-mails that are littered with ”??? ??????” or other characters in odd places;

that generally suggests there’s an encoding problem.

Unicode objects, however, have no encoding; they use a consistent, universal set of characters called, well, “Unicode.”

When you deal with Unicode objects in Python, you can mix and match them safely without having to worry about

encoding issues.

Django uses Unicode objects throughout the framework. Model objects are retrieved as Unicode objects, views interact

with Unicode data, and templates are rendered as Unicode. Generally, you won’t have to worry about making sure

your encodings are right; things should just work.

Note that this has been a very high-level, dumbed down overview of Unicode objects, and you owe it to yourself to

learn more about the topic. A good place to start is http://www.joelonsoftware.com/articles/Unicode.html .

For the __unicode__() changes to take effect, exit out of the Python shell and enter it again with python

manage.py shell. (This is the simplest way to make code changes take effect.) Now the list of Publisher

objects is much easier to understand:

>>> from books.models import Publisher

>>> publisher_list =Publisher.objects.all()

>>> publisher_list

[<Publisher: Apress>, <Publisher: O’Reilly>]

Make sure any model you deﬁne has a __unicode__() method – not only for your own convenience when using

the interactive interpreter, but also because Django uses the output of __unicode__() in several places when it

needs to display objects.

Finally, note that __unicode__() is a good example of adding behavior to models. A Django model describes

more than the database table layout for an object; it also describes any functionality that object knows how to do.

__unicode__() is one example of such functionality – a model knows how to display itself.

7.10 Inserting and Updating Data

You’ve already seen this done: to insert a row into your database, ﬁrst create an instance of your model using keyword

arguments, like so:

>>> p=Publisher(name=’Apress’,

... address=’2855 Telegraph Ave.’,

... city=’Berkeley’,

... state_province=’CA’,

... country=’U.S.A.’,

... website=’http://www.apress.com/’)

As we noted above, this act of instantiating a model class does not touch the database. The record isn’t saved into the

database until you call save(), like this:

>>> p.save()

In SQL, this can roughly be translated into the following:

7.10. Inserting and Updating Data 73

The Django Book, Release 2.0

INSERT INTO books_publisher

(name, address, city, state_province, country, website)

VALUES

(’Apress’, ’2855 Telegraph Ave.’, ’Berkeley’, ’CA’,

’U.S.A.’, ’http://www.apress.com/’);

Because the Publisher model uses an autoincrementing primary key id, the initial call to save() does one more

thing: it calculates the primary key value for the record and sets it to the id attribute on the instance:

>>> p.id

52 # this will differ based on your own data

Subsequent calls to save() will save the record in place, without creating a new record (i.e., performing an SQL

UPDATE statement instead of an INSERT):

>>> p.name =’Apress Publishing’

>>> p.save()

The preceding save() statement will result in roughly the following SQL:

UPDATE books_publisher SET

name = ’Apress Publishing’,

address = ’2855 Telegraph Ave.’,

city = ’Berkeley’,

state_province = ’CA’,

country = ’U.S.A.’,

website = ’http://www.apress.com’

WHERE id = 52;

Yes, note that all of the ﬁelds will be updated, not just the ones that have been changed. Depending on your application,

this may cause a race condition. See “Updating Multiple Objects in One Statement” below to ﬁnd out how to execute

this (slightly different) query:

UPDATE books_publisher SET

name = ’Apress Publishing’

WHERE id=52;

7.11 Selecting Objects

Knowing how to create and update database records is essential, but chances are that the Web applications you’ll build

will be doing more querying of existing objects than creating new ones. We’ve already seen a way to retrieve every

record for a given model:

>>> Publisher.objects.all()

[<Publisher: Apress>, <Publisher: O’Reilly>]

This roughly translates to this SQL:

SELECT id, name, address, city, state_province, country, website

FROM books_publisher;

Note: Notice that Django doesn’t use SELECT *when looking up data and instead lists all ﬁelds explicitly. This

is by design: in certain circumstances SELECT *can be slower, and (more important) listing ﬁelds more closely

follows one tenet of the Zen of Python: “Explicit is better than implicit.”

For more on the Zen of Python, try typing import this at a Python prompt.

74 Chapter 7. Chapter 5: Models

The Django Book, Release 2.0

Let’s take a close look at each part of this Publisher.objects.all() line:

• First, we have the model we deﬁned, Publisher. No surprise here: when you want to look up data, you use

the model for that data.

• Next, we have the objects attribute. This is called a manager. Managers are discussed in detail in Chapter

10. For now, all you need to know is that managers take care of all “table-level” operations on data including,

most important, data lookup.

All models automatically get a objects manager; you’ll use it any time you want to look up model instances.

• Finally, we have all(). This is a method on the objects manager that returns all the rows in the database.

Though this object looks like a list, it’s actually a QuerySet – an object that represents a speciﬁc set of rows from

the database. Appendix C deals with QuerySets in detail. For the rest of this chapter, we’ll just treat them like

the lists they emulate.

Any database lookup is going to follow this general pattern – we’ll call methods on the manager attached to the model

we want to query against.

7.11.1 Filtering Data

Naturally, it’s rare to want to select everything from a database at once; in most cases, you’ll want to deal with a subset

of your data. In the Django API, you can ﬁlter your data using the filter() method:

>>> Publisher.objects.filter(name=’Apress’)

[<Publisher: Apress>]

filter() takes keyword arguments that get translated into the appropriate SQL WHERE clauses. The preceding

example would get translated into something like this:

SELECT id, name, address, city, state_province, country, website

FROM books_publisher

WHERE name = ’Apress’;

You can pass multiple arguments into filter() to narrow down things further:

>>> Publisher.objects.filter(country="U.S.A.", state_province="CA")

[<Publisher: Apress>]

Those multiple arguments get translated into SQL AND clauses. Thus, the example in the code snippet translates into

the following:

SELECT id, name, address, city, state_province, country, website

FROM books_publisher

WHERE country = ’U.S.A.’

AND state_province = ’CA’;

Notice that by default the lookups use the SQL =operator to do exact match lookups. Other lookup types are available:

>>> Publisher.objects.filter(name__contains="press")

[<Publisher: Apress>]

That’s a double underscore there between name and contains. Like Python itself, Django uses the double under-

score to signal that something “magic” is happening – here, the __contains part gets translated by Django into a

SQL LIKE statement:

SELECT id, name, address, city, state_province, country, website

FROM books_publisher

WHERE name LIKE ’%press%’;

7.11. Selecting Objects 75

The Django Book, Release 2.0

Many other types of lookups are available, including icontains (case-insensitive LIKE), startswith and

endswith, and range (SQL BETWEEN queries). Appendix C describes all of these lookup types in detail.

7.11.2 Retrieving Single Objects

The filter() examples above all returned a QuerySet, which you can treat like a list. Sometimes it’s more

convenient to fetch only a single object, as opposed to a list. That’s what the get() method is for:

>>> Publisher.objects.get(name="Apress")

<Publisher: Apress>

Instead of a list (rather, QuerySet), only a single object is returned. Because of that, a query resulting in multiple

objects will cause an exception:

>>> Publisher.objects.get(country="U.S.A.")

Traceback (most recent call last):

...

MultipleObjectsReturned: get() returned more than one Publisher --

it returned 2! Lookup parameters were {’country’: ’U.S.A.’}

A query that returns no objects also causes an exception:

>>> Publisher.objects.get(name="Penguin")

Traceback (most recent call last):

...

DoesNotExist: Publisher matching query does not exist.

The DoesNotExist exception is an attribute of the model’s class – Publisher.DoesNotExist. In your appli-

cations, you’ll want to trap these exceptions, like this:

try:

p=Publisher.objects.get(name=’Apress’)

except Publisher.DoesNotExist:

print "Apress isn’t in the database yet."

else:

print "Apress is in the database."

7.11.3 Ordering Data

As you play around with the previous examples, you might discover that the objects are being returned in a seemingly

random order. You aren’t imagining things; so far we haven’t told the database how to order its results, so we’re simply

getting back data in some arbitrary order chosen by the database.

In your Django applications, you’ll probably want to order your results according to a certain value – say, alphabeti-

cally. To do this, use the order_by() method:

>>> Publisher.objects.order_by("name")

[<Publisher: Apress>, <Publisher: O’Reilly>]

This doesn’t look much different from the earlier all() example, but the SQL now includes a speciﬁc ordering:

SELECT id, name, address, city, state_province, country, website

FROM books_publisher

ORDER BY name;

You can order by any ﬁeld you like:

76 Chapter 7. Chapter 5: Models

The Django Book, Release 2.0

>>> Publisher.objects.order_by("address")

[<Publisher: O’Reilly>, <Publisher: Apress>]

>>> Publisher.objects.order_by("state_province")

[<Publisher: Apress>, <Publisher: O’Reilly>]

To order by multiple ﬁelds (where the second ﬁeld is used to disambiguate ordering in cases where the ﬁrst is the

same), use multiple arguments:

>>> Publisher.objects.order_by("state_province","address")

[<Publisher: Apress>, <Publisher: O’Reilly>]

You can also specify reverse ordering by preﬁxing the ﬁeld name with a -(that’s a minus character):

>>> Publisher.objects.order_by("-name")

[<Publisher: O’Reilly>, <Publisher: Apress>]

While this ﬂexibility is useful, using order_by() all the time can be quite repetitive. Most of the time you’ll have

a particular ﬁeld you usually want to order by. In these cases, Django lets you specify a default ordering in the model:

class Publisher(models.Model):

name = models.CharField(max_length=30)

address = models.CharField(max_length=50)

city = models.CharField(max_length=60)

state_province = models.CharField(max_length=30)

country = models.CharField(max_length=50)

website = models.URLField()

def __unicode__(self):

return self.name

class Meta:

ordering = [’name’]

Here, we’ve introduced a new concept: the class Meta, which is a class that’s embedded within the Publisher

class deﬁnition (i.e., it’s indented to be within class Publisher). You can use this Meta class on any model to

specify various model-speciﬁc options. A full reference of Meta options is available in Appendix B, but for now, we’re

concerned with the ordering option. If you specify this, it tells Django that unless an ordering is given explicitly

with order_by(), all Publisher objects should be ordered by the name ﬁeld whenever they’re retrieved with

the Django database API.

7.11.4 Chaining Lookups

You’ve seen how you can ﬁlter data, and you’ve seen how you can order it. Often, of course, you’ll need to do both.

In these cases, you simply “chain” the lookups together:

>>> Publisher.objects.filter(country="U.S.A.").order_by("-name")

[<Publisher: O’Reilly>, <Publisher: Apress>]

As you might expect, this translates to a SQL query with both a WHERE and an ORDER BY:

SELECT id, name, address, city, state_province, country, website

FROM books_publisher

WHERE country = ’U.S.A’

ORDER BY name DESC;

7.11. Selecting Objects 77

The Django Book, Release 2.0

7.11.5 Slicing Data

Another common need is to look up only a ﬁxed number of rows. Imagine you have thousands of publishers in your

database, but you want to display only the ﬁrst one. You can do this using Python’s standard list slicing syntax:

>>> Publisher.objects.order_by(’name’)[0]

<Publisher: Apress>

This translates roughly to:

SELECT id, name, address, city, state_province, country, website

FROM books_publisher

ORDER BY name

LIMIT 1;

Similarly, you can retrieve a speciﬁc subset of data using Python’s range-slicing syntax:

>>> Publisher.objects.order_by(’name’)[0:2]

This returns two objects, translating roughly to:

SELECT id, name, address, city, state_province, country, website

FROM books_publisher

ORDER BY name

OFFSET 0 LIMIT 2;

Note that negative slicing is not supported:

>>> Publisher.objects.order_by(’name’)[-1]

Traceback (most recent call last):

...

AssertionError: Negative indexing is not supported.

This is easy to get around, though. Just change the order_by() statement, like this:

>>> Publisher.objects.order_by(’-name’)[0]

7.11.6 Updating Multiple Objects in One Statement

We pointed out in the “Inserting and Updating Data” section that the model save() method updates all columns in

a row. Depending on your application, you may want to update only a subset of columns.

For example, let’s say we want to update the Apress Publisher to change the name from ’Apress’ to ’Apress

Publishing’. Using save(), it would look something like this:

>>> p=Publisher.objects.get(name=’Apress’)

>>> p.name =’Apress Publishing’

>>> p.save()

This roughly translates to the following SQL:

SELECT id, name, address, city, state_province, country, website

FROM books_publisher

WHERE name = ’Apress’;

UPDATE books_publisher SET

name = ’Apress Publishing’,

address = ’2855 Telegraph Ave.’,

city = ’Berkeley’,

78 Chapter 7. Chapter 5: Models

The Django Book, Release 2.0

state_province = ’CA’,

country = ’U.S.A.’,

website = ’http://www.apress.com’

WHERE id = 52;

(Note that this example assumes Apress has a publisher ID of 52.)

You can see in this example that Django’s save() method sets all of the column values, not just the name column.

If you’re in an environment where other columns of the database might change due to some other process, it’s smarter

to change only the column you need to change. To do this, use the update() method on QuerySet objects. Here’s

an example:

>>> Publisher.objects.filter(id=52).update(name=’Apress Publishing’)

The SQL translation here is much more efﬁcient and has no chance of race conditions:

UPDATE books_publisher

SET name = ’Apress Publishing’

WHERE id = 52;

The update() method works on any QuerySet, which means you can edit multiple records in bulk. Here’s how

you might change the country from ’U.S.A.’ to USA in each Publisher record:

>>> Publisher.objects.all().update(country=’USA’)

The update() method has a return value – an integer representing how many records changed. In the above example,

we got 2.

7.12 Deleting Objects

To delete an object from your database, simply call the object’s delete() method:

>>> p=Publisher.objects.get(name="O’Reilly")

>>> p.delete()

>>> Publisher.objects.all()

[<Publisher: Apress Publishing>]

You can also delete objects in bulk by calling delete() on the result of any QuerySet. This is similar to the

update() method we showed in the last section:

>>> Publisher.objects.filter(country=’USA’).delete()

>>> Publisher.objects.all().delete()

>>> Publisher.objects.all()

[]

Be careful deleting your data! As a precaution against deleting all of the data in a particular table, Django requires

you to explicitly use all() if you want to delete everything in your table. For example, this won’t work:

>>> Publisher.objects.delete()

Traceback (most recent call last):

File "<console>", line 1, in <module>

AttributeError: ’Manager’ object has no attribute ’delete’

But it’ll work if you add the all() method:

>>> Publisher.objects.all().delete()

7.12. Deleting Objects 79

The Django Book, Release 2.0

If you’re just deleting a subset of your data, you don’t need to include all(). To repeat a previous example:

>>> Publisher.objects.filter(country=’USA’).delete()

7.13 What’s Next?

Having read this chapter, you have enough knowledge of Django models to be able to write basic database applications.

Chapter 10 will provide some information on more advanced usage of Django’s database layer.

Once you’ve deﬁned your models, the next step is to populate your database with data. You might have legacy data,

in which case Chapter 18 will give you advice about integrating with legacy databases. You might rely on site users to

supply your data, in which case Chapter 7 will teach you how to process user-submitted form data.

But in some cases, you or your team might need to enter data manually, in which case it would be helpful to have a

Web-based interface for entering and managing data. The next chapter Chapter 6 covers Django’s admin interface,

which exists precisely for that reason.

80 Chapter 7. Chapter 5: Models

CHAPTER 8

Chapter 6: The Django Admin Site

For a certain class of Web sites, an admin interface is an essential part of the infrastructure. This is a Web-based

interface, limited to trusted site administrators, that enables the adding, editing and deletion of site content. Some

common examples: the interface you use to post to your blog, the backend site managers use to moderate user-

generated comments, the tool your clients use to update the press releases on the Web site you built for them.

There’s a problem with admin interfaces, though: it’s boring to build them. Web development is fun when you’re

developing public-facing functionality, but building admin interfaces is always the same. You have to authenticate

users, display and handle forms, validate input, and so on. It’s boring, and it’s repetitive.

So what’s Django’s approach to these boring, repetitive tasks? It does it all for you – in just a couple of lines of code,

no less. With Django, building an admin interface is a solved problem.

This chapter is about Django’s automatic admin interface. The feature works by reading metadata in your model

to provide a powerful and production-ready interface that site administrators can start using immediately. Here, we

discuss how to activate, use, and customize this feature.

Note that we recommend reading this chapter even if you don’t intend to use the Django admin site, because we

introduce a few concepts that apply to all of Django, regardless of admin-site usage.

8.1 The django.contrib packages

Django’s automatic admin is part of a larger suite of Django functionality called django.contrib – the part of the

Django codebase that contains various useful add-ons to the core framework. You can think of django.contrib as

Django’s equivalent of the Python standard library – optional, de facto implementations of common patterns. They’re

bundled with Django so that you don’t have to reinvent the wheel in your own applications.

The admin site is the ﬁrst part of django.contrib that we’re covering in this book; technically, it’s called

django.contrib.admin. Other available features in django.contrib include a user authentication sys-

tem (django.contrib.auth), support for anonymous sessions (django.contrib.sessions) and even a

system for user comments (django.contrib.comments). You’ll get to know the various django.contrib

features as you become a Django expert, and we’ll spend some more time discussing them in Chapter 16. For now,

just know that Django ships with many nice add-ons, and django.contrib is generally where they live.

8.2 Activating the Admin Interface

The Django admin site is entirely optional, because only certain types of sites need this functionality. That means

you’ll need to take a few steps to activate it in your project.

First, make a few changes to your settings ﬁle:

The Django Book, Release 2.0

1. Add ’django.contrib.admin’ to the INSTALLED_APPS setting. (The order of INSTALLED_APPS

doesn’t matter, but we like to keep things alphabetical so it’s easy for a human to read.)

2. Make sure INSTALLED_APPS contains ’django.contrib.auth’,

’django.contrib.contenttypes’,’django.contrib.messages’ and

’django.contrib.sessions’. The Django admin site requires these three packages. (If you’re fol-

lowing along with our ongoing mysite project, note that we commented out these four INSTALLED_APPS

entries in Chapter 5. Uncomment them now.)

3. Make sure MIDDLEWARE_CLASSES contains ’django.middleware.common.CommonMiddleware’,

’django.contrib.messages.middleware.MessageMiddleware’,

’django.contrib.sessions.middleware.SessionMiddleware’ and

’django.contrib.auth.middleware.AuthenticationMiddleware’. (Again, if you’re

following along, note that we commented them out in Chapter 5, so uncomment them.)

Second, run python manage.py syncdb. This step will install the extra database tables that the admin interface

uses. The ﬁrst time you run syncdb with ’django.contrib.auth’ in INSTALLED_APPS, you’ll be asked

about creating a superuser. If you don’t do this, you’ll need to run python manage.py createsuperuser

separately to create an admin user account; otherwise, you won’t be able to log in to the admin site. (Potential gotcha:

the python manage.py createsuperuser command is only available if ’django.contrib.auth’ is in

your INSTALLED_APPS.)

Third, add the admin site to your URLconf (in urls.py, remember). By default, the urls.py generated by

django-admin.py startproject contains commented-out code for the Django admin, and all you have to do

is uncomment it. For the record, here are the bits you need to make sure are in there:

# Include these import statements...

from django.contrib import admin

admin.autodiscover()

# And include this URLpattern...

urlpatterns =patterns(’’,

# ...

(r’^admin/’, include(admin.site.urls)),

# ...

)

With that bit of conﬁguration out of the way, now you can see the Django admin site in action.

Just run the development server (python manage.py runserver, as in previous chapters) and visit

http://127.0.0.1:8000/admin/ in your Web browser.

8.3 Using the Admin Site

The admin site is designed to be used by nontechnical users, and as such it should be pretty self-explanatory. Never-

theless, we’ll give you a quick walkthrough of the basic features.

The ﬁrst thing you’ll see is a login screen, as shown in Figure 6-1.

sure you’ve actually created a superuser – try running python manage.py createsuperuser.

Once you’re logged in, the ﬁrst thing you’ll see will be the admin home page. This page lists all the available types of

data that can be edited on the admin site. At this point, because we haven’t activated any of our own models yet, the

list is sparse: it includes only Groups and Users, which are the two default admin-editable models.

Each type of data in the Django admin site has a change list and an edit form. Change lists show you all the available

objects in the database, and edit forms let you add, change or delete particular records in your database.

82 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

Figure 8.1: Figure 6-1. Django’s login screen

8.3. Using the Admin Site 83

The Django Book, Release 2.0

Figure 8.2: Figure 6-2. The Django admin home page

84 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

Other languages

If your primary language is not English and your Web browser is conﬁgured to prefer a language other than English,

you can make a quick change to see whether the Django admin site has been translated into your language. Just add

’django.middleware.locale.LocaleMiddleware’ to your MIDDLEWARE_CLASSES setting, making

sure it appears after ’django.contrib.sessions.middleware.SessionMiddleware’.

When you’ve done that, reload the admin index page. If a translation for your language is available, then the various

parts of the interface – from the “Change password” and “Log out” links at the top of the page, to the “Groups” and

“Users” links in the middle – will appear in your language instead of English. Django ships with translations for

dozens of languages.

For much more on Django’s internationalization features, see Chapter 19.

Click the “Change” link in the “Users” row to load the change list page for users.

Figure 8.3: Figure 6-3. The user change list page

This page displays all users in the database; you can think of it as a prettied-up Web version of a SELECT *FROM

auth_user; SQL query. If you’re following along with our ongoing example, you’ll only see one user here,

assuming you’ve added only one, but once you have more users, you’ll probably ﬁnd the ﬁltering, sorting and searching

options useful. Filtering options are at right, sorting is available by clicking a column header, and the search box at

the top lets you search by username.

8.3. Using the Admin Site 85

The Django Book, Release 2.0

Click the username of the user you created, and you’ll see the edit form for that user.

Figure 8.4: Figure 6-4. The user edit form

This page lets you change the attributes of the user, like the ﬁrst/last names and various permissions. (Note that to

change a user’s password, you should click “change password form” under the password ﬁeld rather than editing the

hashed code.) Another thing to note here is that ﬁelds of different types get different widgets – for example, date/time

ﬁelds have calendar controls, boolean ﬁelds have checkboxes, character ﬁelds have simple text input ﬁelds.

You can delete a record by clicking the delete button at the bottom left of its edit form. That’ll take you to a conﬁr-

mation page, which, in some cases, will display any dependent objects that will be deleted, too. (For example, if you

delete a publisher, any book with that publisher will be deleted, too!)

You can add a record by clicking “Add” in the appropriate column of the admin home page. This will give you an

empty version of the edit page, ready for you to ﬁll out.

You’ll also notice that the admin interface also handles input validation for you. Try leaving a required ﬁeld blank or

putting an invalid date into a date ﬁeld, and you’ll see those errors when you try to save, as shown in Figure 6-5.

When you edit an existing object, you’ll notice a History link in the upper-right corner of the window. Every change

made through the admin interface is logged, and you can examine this log by clicking the History link (see Figure

6-6).

86 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

Figure 8.5: Figure 6-5. An edit form displaying errors

8.3. Using the Admin Site 87

The Django Book, Release 2.0

Figure 8.6: Figure 6-6. An object history page

88 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

8.4 Adding Your Models to the Admin Site

There’s one crucial part we haven’t done yet. Let’s add our own models to the admin site, so we can add, change

and delete objects in our custom database tables using this nice interface. We’ll continue the books example from

Chapter 5, where we deﬁned three models: Publisher,Author and Book.

Within the books directory (mysite/books), create a ﬁle called admin.py, and type in the following lines of

code:

from django.contrib import admin

from mysite.books.models import Publisher, Author, Book

admin.site.register(Publisher)

admin.site.register(Author)

admin.site.register(Book)

This code tells the Django admin site to offer an interface for each of these models.

Once you’ve done this, go to your admin home page in your Web browser (http://127.0.0.1:8000/admin/),

and you should see a “Books” section with links for Authors, Books and Publishers. (You might have to stop and start

the runserver for the changes to take effect.)

You now have a fully functional admin interface for each of those three models. That was easy!

Take some time to add and change records, to populate your database with some data. If you followed Chapter

5’s examples of creating Publisher objects (and you didn’t delete them), you’ll already see those records on the

publisher change list page.

One feature worth mentioning here is the admin site’s handling of foreign keys and many-to-many relationships, both

of which appear in the Book model. As a reminder, here’s what the Book model looks like:

class Book(models.Model):

title =models.CharField(max_length=100)

authors =models.ManyToManyField(Author)

publisher =models.ForeignKey(Publisher)

publication_date =models.DateField()

def __unicode__(self):

return self.title

On the Django admin site’s “Add book” page (http://127.0.0.1:8000/admin/books/book/add/), the

publisher (a ForeignKey) is represented by a select box, and the authors ﬁeld (a ManyToManyField) is repre-

sented by a multiple-select box. Both ﬁelds sit next to a green plus sign icon that lets you add related records of that

type. For example, if you click the green plus sign next to the “Publisher” ﬁeld, you’ll get a pop-up window that lets

you add a publisher. After you successfully create the publisher in the pop-up, the “Add book” form will be updated

with the newly created publisher. Slick.

8.5 How the Admin Site Works

Behind the scenes, how does the admin site work? It’s pretty straightforward.

When Django loads your URLconf from urls.py at server startup, it executes the admin.autodiscover()

statement that we added as part of activating the admin. This function iterates over your INSTALLED_APPS setting

and looks for a ﬁle called admin.py in each installed app. If an admin.py exists in a given app, it executes the

code in that ﬁle.

8.4. Adding Your Models to the Admin Site 89

The Django Book, Release 2.0

In the admin.py in our books app, each call to admin.site.register() simply registers the given model

with the admin. The admin site will only display an edit/change interface for models that have been explicitly regis-

tered.

The app django.contrib.auth includes its own admin.py, which is why Users and Groups showed up au-

tomatically in the admin. Other django.contrib apps, such as django.contrib.redirects, also add

themselves to the admin, as do many third-party Django applications you might download from the Web.

Beyond that, the Django admin site is just a Django application, with its own models, templates, views and URLpat-

terns. You add it to your application by hooking it into your URLconf, just as you hook in your own views. You can

inspect its templates, views and URLpatterns by poking around in django/contrib/admin in your copy of the

Django codebase – but don’t be tempted to change anything directly in there, as there are plenty of hooks for you to

customize the way the admin site works. (If you do decide to poke around the Django admin application, keep in mind

it does some rather complicated things in reading metadata about models, so it would probably take a good amount of

time to read and understand the code.)

8.6 Making Fields Optional

After you play around with the admin site for a while, you’ll probably notice a limitation – the edit forms require every

ﬁeld to be ﬁlled out, whereas in many cases you’d want certain ﬁelds to be optional. Let’s say, for example, that we

want our Author model’s email ﬁeld to be optional – that is, a blank string should be allowed. In the real world,

you might not have an e-mail address on ﬁle for every author.

To specify that the email ﬁeld is optional, edit the Author model (which, as you’ll recall from Chapter 5, lives in

mysite/books/models.py). Simply add blank=True to the email ﬁeld, like so:

class Author(models.Model):

first_name = models.CharField(max_length=30)

last_name = models.CharField(max_length=40)

email = models.EmailField(blank=True)

This tells Django that a blank value is indeed allowed for authors’ e-mail addresses. By default, all ﬁelds have

blank=False, which means blank values are not allowed.

There’s something interesting happening here. Until now, with the exception of the __unicode__() method, our

models have served as deﬁnitions of our database tables – Pythonic expressions of SQL CREATE TABLE statements,

essentially. In adding blank=True, we have begun expanding our model beyond a simple deﬁnition of what the

database table looks like. Now, our model class is starting to become a richer collection of knowledge about what

Author objects are and what they can do. Not only is the email ﬁeld represented by a VARCHAR column in the

database; it’s also an optional ﬁeld in contexts such as the Django admin site.

Once you’ve added that blank=True, reload the “Add author” edit form

(http://127.0.0.1:8000/admin/books/author/add/), and you’ll notice the ﬁeld’s label – “Email”

– is no longer bolded. This signiﬁes it’s not a required ﬁeld. You can now add authors without needing to provide

e-mail addresses; you won’t get the loud red “This ﬁeld is required” message anymore, if the ﬁeld is submitted empty.

8.6.1 Making Date and Numeric Fields Optional

A common gotcha related to blank=True has to do with date and numeric ﬁelds, but it requires a fair amount of

background explanation.

SQL has its own way of specifying blank values – a special value called NULL.NULL could mean “unknown,” or

“invalid,” or some other application-speciﬁc meaning.

90 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

In SQL, a value of NULL is different than an empty string, just as the special Python object None is different than

an empty Python string (""). This means it’s possible for a particular character ﬁeld (e.g., a VARCHAR column) to

contain both NULL values and empty string values.

This can cause unwanted ambiguity and confusion: “Why does this record have a NULL but this other one has an

empty string? Is there a difference, or was the data just entered inconsistently?” And: “How do I get all the records

that have a blank value – should I look for both NULL records and empty strings, or do I only select the ones with

empty strings?”

To help avoid such ambiguity, Django’s automatically generated CREATE TABLE statements (which were covered in

Chapter 5) add an explicit NOT NULL to each column deﬁnition. For example, here’s the generated statement for our

Author model, from Chapter 5:

CREATE TABLE "books_author" (

"id" serial NOT NULL PRIMARY KEY,

"first_name" varchar(30) NOT NULL,

"last_name" varchar(40) NOT NULL,

"email" varchar(75) NOT NULL

)

;

In most cases, this default behavior is optimal for your application and will save you from data-inconsistency

headaches. And it works nicely with the rest of Django, such as the Django admin site, which inserts an empty

string (not aNULL value) when you leave a character ﬁeld blank.

But there’s an exception with database column types that do not accept empty strings as valid values – such as dates,

times and numbers. If you try to insert an empty string into a date or integer column, you’ll likely get a database error,

depending on which database you’re using. (PostgreSQL, which is strict, will raise an exception here; MySQL might

accept it or might not, depending on the version you’re using, the time of day and the phase of the moon.) In this case,

NULL is the only way to specify an empty value. In Django models, you can specify that NULL is allowed by adding

null=True to a ﬁeld.

So that’s a long way of saying this: if you want to allow blank values in a date ﬁeld (e.g., DateField,TimeField,

DateTimeField) or numeric ﬁeld (e.g., IntegerField,DecimalField,FloatField), you’ll need to use

both null=True and blank=True.

For sake of example, let’s change our Book model to allow a blank publication_date. Here’s the revised code:

class Book(models.Model):

title = models.CharField(max_length=100)

authors = models.ManyToManyField(Author)

publisher = models.ForeignKey(Publisher)

publication_date = models.DateField(blank=True, null=True)

Adding null=True is more complicated than adding blank=True, because null=True changes the seman-

tics of the database – that is, it changes the CREATE TABLE statement to remove the NOT NULL from the

publication_date ﬁeld. To complete this change, we’ll need to update the database.

For a number of reasons, Django does not attempt to automate changes to database schemas, so it’s your own respon-

sibility to execute the appropriate ALTER TABLE statement whenever you make such a change to a model. Recall

that you can use manage.py dbshell to enter your database server’s shell. Here’s how to remove the NOT NULL

in this particular case:

ALTER TABLE books_book ALTER COLUMN publication_date DROP NOT NULL;

(Note that this SQL syntax is speciﬁc to PostgreSQL.)

We’ll cover schema changes in more depth in Chapter 10.

Bringing this back to the admin site, now the “Add book” edit form should allow for empty publication date values.

8.6. Making Fields Optional 91

The Django Book, Release 2.0

8.7 Customizing Field Labels

On the admin site’s edit forms, each ﬁeld’s label is generated from its model ﬁeld name. The algorithm is simple:

Django just replaces underscores with spaces and capitalizes the ﬁrst character, so, for example, the Book model’s

publication_date ﬁeld has the label “Publication date.”

However, ﬁeld names don’t always lend themselves to nice admin ﬁeld labels, so in some cases you might want to

customize a label. You can do this by specifying verbose_name in the appropriate model ﬁeld.

For example, here’s how we can change the label of the Author.email ﬁeld to “e-mail,” with a hyphen:

class Author(models.Model):

first_name = models.CharField(max_length=30)

last_name = models.CharField(max_length=40)

email = models.EmailField(blank=True, verbose_name=’e-mail’)

Make that change and reload the server, and you should see the ﬁeld’s new label on the author edit form.

Note that you shouldn’t capitalize the ﬁrst letter of a verbose_name unless it should always be capitalized (e.g.,

"USA state"). Django will automatically capitalize it when it needs to, and it will use the exact verbose_name

value in other places that don’t require capitalization.

Finally, note that you can pass the verbose_name as a positional argument, for a slightly more compact syntax.

This example is equivalent to the previous one:

class Author(models.Model):

first_name = models.CharField(max_length=30)

last_name = models.CharField(max_length=40)

email = models.EmailField(‘e-mail’, blank=True)

This won’t work with ManyToManyField or ForeignKey ﬁelds, though, because they require the ﬁrst argument

to be a model class. In those cases, specifying verbose_name explicitly is the way to go.

8.8 Custom ModelAdmin classes

The changes we’ve made so far – blank=True,null=True and verbose_name – are really model-level

changes, not admin-level changes. That is, these changes are fundamentally a part of the model and just so happen to

be used by the admin site; there’s nothing admin-speciﬁc about them.

Beyond these, the Django admin site offers a wealth of options that let you customize how the admin site works for a

particular model. Such options live in ModelAdmin classes, which are classes that contain conﬁguration for a speciﬁc

model in a speciﬁc admin site instance.

8.8.1 Customizing change lists

Let’s dive into admin customization by specifying the ﬁelds that are displayed on the change list for our Author

model. By default, the change list displays the result of __unicode__() for each object. In Chapter 5, we deﬁned

the __unicode__() method for Author objects to display the ﬁrst name and last name together:

class Author(models.Model):

first_name = models.CharField(max_length=30)

last_name = models.CharField(max_length=40)

email = models.EmailField(blank=True, verbose_name=’e-mail’)

def __unicode__(self):

return u’%s %s’ % (self.first_name, self.last_name)

92 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

As a result, the change list for Author objects displays each other’s ﬁrst name and last name together, as you can see

in Figure 6-7.

Figure 8.7: Figure 6-7. The author change list page

We can improve on this default behavior by adding a few other ﬁelds to the change list display. It’d be handy, for

example, to see each author’s e-mail address in this list, and it’d be nice to be able to sort by ﬁrst and last name.

To make this happen, we’ll deﬁne a ModelAdmin class for the Author model. This class is the key to customizing

the admin, and one of the most basic things it lets you do is specify the list of ﬁelds to display on change list pages.

Edit admin.py to make these changes:

from django.contrib import admin

from mysite.books.models import Publisher, Author, Book

class AuthorAdmin(admin.ModelAdmin):

list_display = (‘first_name’, ‘last_name’, ‘email’)

admin.site.register(Publisher)

admin.site.register(Author, AuthorAdmin)

admin.site.register(Book)

Here’s what we’ve done:

8.8. Custom ModelAdmin classes 93

The Django Book, Release 2.0

• We created the class AuthorAdmin. This class, which subclasses

django.contrib.admin.ModelAdmin, holds custom conﬁguration for a speciﬁc admin model.

We’ve only speciﬁed one customization – list_display, which is set to a tuple of ﬁeld names to display on

the change list page. These ﬁeld names must exist in the model, of course.

• We altered the admin.site.register() call to add AuthorAdmin after Author. You can read this as:

“Register the Author model with the AuthorAdmin options.”

The admin.site.register() function takes a ModelAdmin subclass as an optional second argument.

If you don’t specify a second argument (as is the case for Publisher and Book), Django will use the default

admin options for that model.

With that tweak made, reload the author change list page, and you’ll see it’s now displaying three columns – the ﬁrst

name, last name and e-mail address. In addition, each of those columns is sortable by clicking on the column header.

(See Figure 6-8.)

Figure 8.8: Figure 6-8. The author change list page after list_display

Next, let’s add a simple search bar. Add search_fields to the AuthorAdmin, like so:

class AuthorAdmin(admin.ModelAdmin):

list_display = (‘first_name’, ‘last_name’, ‘email’)

search_fields = (‘first_name’, ‘last_name’)

94 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

Reload the page in your browser, and you should see a search bar at the top. (See Figure 6-9.) We’ve just told the

admin change list page to include a search bar that searches against the first_name and last_name ﬁelds. As a

user might expect, this is case-insensitive and searches both ﬁelds, so searching for the string "bar" would ﬁnd both

an author with the ﬁrst name Barney and an author with the last name Hobarson.

Figure 8.9: Figure 6-9. The author change list page after search_ﬁelds

Next, let’s add some date ﬁlters to our Book model’s change list page:

from django.contrib import admin

from mysite.books.models import Publisher, Author, Book

class AuthorAdmin(admin.ModelAdmin):

list_display = (‘first_name’, ‘last_name’, ‘email’)

search_fields = (‘first_name’, ‘last_name’)

class BookAdmin(admin.ModelAdmin):

list_display = (‘title’, ‘publisher’, ‘publication_date’)

list_filter = (‘publication_date’,)

admin.site.register(Publisher)

admin.site.register(Author, AuthorAdmin)

admin.site.register(Book, BookAdmin)

8.8. Custom ModelAdmin classes 95

The Django Book, Release 2.0

Here, because we’re dealing with a different set of options, we created a separate ModelAdmin class – BookAdmin.

First, we deﬁned a list_display just to make the change list look a bit nicer. Then, we used list_filter,

which is set to a tuple of ﬁelds to use to create ﬁlters along the right side of the change list page. For date ﬁelds,

Django provides shortcuts to ﬁlter the list to “Today,” “Past 7 days,” “This month” and “This year” – shortcuts that

Django’s developers have found hit the common cases for ﬁltering by date. Figure 6-10 shows what that looks like.

Figure 8.10: Figure 6-10. The book change list page after list_ﬁlter

list_filter also works on ﬁelds of other types, not just DateField. (Try it with BooleanField and

ForeignKey ﬁelds, for example.) The ﬁlters show up as long as there are at least 2 values to choose from.

Another way to offer date ﬁlters is to use the date_hierarchy admin option, like this:

class BookAdmin(admin.ModelAdmin):

list_display = (‘title’, ‘publisher’, ‘publication_date’)

list_filter = (‘publication_date’,)

date_hierarchy = ‘publication_date’

With this in place, the change list page gets a date drill-down navigation bar at the top of the list, as shown in Figure

6-11. It starts with a list of available years, then drills down into months and individual days.

Note that date_hierarchy takes a string, not a tuple, because only one date ﬁeld can be used to make the hierarchy.

Finally, let’s change the default ordering so that books on the change list page are always ordered descending by their

publication date. By default, the change list orders objects according to their model’s ordering within class

96 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

Figure 8.11: Figure 6-11. The book change list page after date_hierarchy

8.8. Custom ModelAdmin classes 97

The Django Book, Release 2.0

Meta (which we covered in Chapter 5) – but you haven’t speciﬁed this ordering value, then the ordering is unde-

ﬁned.

class BookAdmin(admin.ModelAdmin):

list_display = (‘title’, ‘publisher’, ‘publication_date’)

list_filter = (‘publication_date’,)

date_hierarchy = ‘publication_date’

ordering = (‘-publication_date’,)

This admin ordering option works exactly as the ordering in models’ class Meta, except that it only uses

the ﬁrst ﬁeld name in the list. Just pass a list or tuple of ﬁeld names, and add a minus sign to a ﬁeld to use descending

sort order.

Reload the book change list to see this in action. Note that the “Publication date” header now includes a small arrow

that indicates which way the records are sorted. (See Figure 6-12.)

Figure 8.12: Figure 6-12. The book change list page after ordering

We’ve covered the main change list options here. Using these options, you can make a very powerful, production-ready

data-editing interface with only a few lines of code.

98 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

8.8.2 Customizing edit forms

Just as the change list can be customized, edit forms can be customized in many ways.

First, let’s customize the way ﬁelds are ordered. By default, the order of ﬁelds in an edit form corresponds to the order

they’re deﬁned in the model. We can change that using the fields option in our ModelAdmin subclass:

class BookAdmin(admin.ModelAdmin):

list_display = (‘title’, ‘publisher’, ‘publication_date’)

list_filter = (‘publication_date’,)

date_hierarchy = ‘publication_date’

ordering = (‘-publication_date’,)

fields = (‘title’, ‘authors’, ‘publisher’, ‘publication_date’)

After this change, the edit form for books will use the given ordering for ﬁelds. It’s slightly more natural to have

the authors after the book title. Of course, the ﬁeld order should depend on your data-entry workﬂow. Every form is

different.

Another useful thing the fields option lets you do is to exclude certain ﬁelds from being edited entirely. Just leave

out the ﬁeld(s) you want to exclude. You might use this if your admin users are only trusted to edit a certain segment

of your data, or if part of your ﬁelds are changed by some outside, automated process. For example, in our book

database, we could hide the publication_date ﬁeld from being editable:

class BookAdmin(admin.ModelAdmin):

list_display = (‘title’, ‘publisher’, ‘publication_date’)

list_filter = (‘publication_date’,)

date_hierarchy = ‘publication_date’

ordering = (‘-publication_date’,)

fields = (‘title’, ‘authors’, ‘publisher’)

As a result, the edit form for books doesn’t offer a way to specify the publication date. This could be useful, say, if

you’re an editor who prefers that his authors not push back publication dates. (This is purely a hypothetical example,

of course.)

When a user uses this incomplete form to add a new book, Django will simply set the publication_date to None

– so make sure that ﬁeld has null=True.

Another commonly used edit-form customization has to do with many-to-many ﬁelds. As we’ve seen on the edit form

for books, the admin site represents each ManyToManyField as a multiple-select boxes, which is the most logical

HTML input widget to use – but multiple-select boxes can be difﬁcult to use. If you want to select multiple items, you

have to hold down the control key, or command on a Mac, to do so. The admin site helpfully inserts a bit of text that

explains this, but, still, it gets unwieldy when your ﬁeld contains hundreds of options.

The admin site’s solution is filter_horizontal. Let’s add that to BookAdmin and see what it does.

class BookAdmin(admin.ModelAdmin):

list_display = (‘title’, ‘publisher’, ‘publication_date’)

list_filter = (‘publication_date’,)

date_hierarchy = ‘publication_date’

ordering = (‘-publication_date’,)

filter_horizontal = (‘authors’,)

(If you’re following along, note that we’ve also removed the fields option to restore all the ﬁelds in the edit form.)

Reload the edit form for books, and you’ll see that the “Authors” section now uses a fancy JavaScript ﬁlter interface that

lets you search through the options dynamically and move speciﬁc authors from “Available authors” to the “Chosen

authors” box, and vice versa.

We’d highly recommend using filter_horizontal for any ManyToManyField that has more than 10 items.

It’s far easier to use than a simple multiple-select widget. Also, note you can use filter_horizontal for multiple

8.8. Custom ModelAdmin classes 99

The Django Book, Release 2.0

Figure 8.13: Figure 6-13. The book edit form after adding ﬁlter_horizontal

100 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

ﬁelds – just specify each name in the tuple.

ModelAdmin classes also support a filter_vertical option. This works exactly as filter_horizontal,

but the resulting JavaScript interface stacks the two boxes vertically instead of horizontally. It’s a matter of personal

taste.

filter_horizontal and filter_vertical only work on ManyToManyField ﬁelds, not ForeignKey

ﬁelds. By default, the admin site uses simple <select> boxes for ForeignKey ﬁelds, but, as for

ManyToManyField, sometimes you don’t want to incur the overhead of having to select all the related objects

to display in the drop-down. For example, if our book database grows to include thousands of publishers, the “Add

book” form could take a while to load, because it would have to load every publisher for display in the <select>

box.

The way to ﬁx this is to use an option called raw_id_fields. Set this to a tuple of ForeignKey ﬁeld names,

and those ﬁelds will be displayed in the admin with a simple text input box (<input type="text">) instead of a

<select>. See Figure 6-14.

class BookAdmin(admin.ModelAdmin):

list_display = (‘title’, ‘publisher’, ‘publication_date’)

list_filter = (‘publication_date’,)

date_hierarchy = ‘publication_date’

ordering = (‘-publication_date’,)

filter_horizontal = (‘authors’,)

raw_id_fields = (‘publisher’,)

What do you enter in this input box? The database ID of the publisher. Given that humans don’t normally memorize

database IDs, there’s also a magnifying-glass icon that you can click to pull up a pop-up window, from which you can

select the publisher to add.

8.9 Users, Groups, and Permissions

Because you’re logged in as a superuser, you have access to create, edit, and delete any object. Naturally, different

environments require different permission systems – not everybody can or should be a superuser. Django’s admin site

uses a permissions system that you can use to give speciﬁc users access only to the portions of the interface that they

need.

These user accounts are meant to be generic enough to be used outside of the admin interface, but we’ll just treat them

as admin user accounts for now. In Chapter 14, we’ll cover how to integrate user accounts with the rest of your site

(i.e., not just the admin site).

You can edit users and permissions through the admin interface just like any other object. We saw this earlier in

this chapter, when we played around with the User and Group sections of the admin. User objects have the standard

username, password, e-mail and real name ﬁelds you might expect, along with a set of ﬁelds that deﬁne what the user

is allowed to do in the admin interface. First, there’s a set of three boolean ﬂags:

• The “active” ﬂag controls whether the user is active at all. If this ﬂag is off and the user tries to log in, he won’t

be allowed in, even with a valid password.

• The “staff” ﬂag controls whether the user is allowed to log in to the admin interface (i.e., whether that user is

considered a “staff member” in your organization). Since this same user system can be used to control access to

public (i.e., non-admin) sites (see Chapter 14), this ﬂag differentiates between public users and administrators.

• The “superuser” ﬂag gives the user full access to add, create and delete any item in the admin interface. If a user

has this ﬂag set, then all regular permissions (or lack thereof) are ignored for that user.

“Normal” admin users – that is, active, non-superuser staff members – are granted admin access through assigned

permissions. Each object editable through the admin interface (e.g., books, authors, publishers) has three permissions:

8.9. Users, Groups, and Permissions 101

The Django Book, Release 2.0

Figure 8.14: Figure 6-14. The book edit form after adding raw_id_ﬁelds

102 Chapter 8. Chapter 6: The Django Admin Site

The Django Book, Release 2.0

acreate permission, an edit permission and a delete permission. Assigning permissions to a user grants the user access

to do what is described by those permissions.

When you create a user, that user has no permissions, and it’s up to you to give the user speciﬁc permissions. For

example, you can give a user permission to add and change publishers, but not permission to delete them. Note

that these permissions are deﬁned per-model, not per-object – so they let you say “John can make changes to any

book,” but they don’t let you say “John can make changes to any book published by Apress.” The latter functionality,

per-object permissions, is a bit more complicated and is outside the scope of this book but is covered in the Django

documentation.

Note: Access to edit users and permissions is also controlled by this permission system. If you give someone

permission to edit users, she will be able to edit her own permissions, which might not be what you want! Giving a

user permission to edit users is essentially turning a user into a superuser.

You can also assign users to groups. A group is simply a set of permissions to apply to all members of that group.

Groups are useful for granting identical permissions to a subset of users.

8.10 When and Why to Use the Admin Interface – And When Not to

After having worked through this chapter, you should have a good idea of how to use Django’s admin site. But we

want to make a point of covering when and why you might want to use it – and when not to use it.

Django’s admin site especially shines when nontechnical users need to be able to enter data; that’s the purpose behind

the feature, after all. At the newspaper where Django was ﬁrst developed, development of a typical online feature –

say, a special report on water quality in the municipal supply – would go something like this:

• The reporter responsible for the project meets with one of the developers and describes the available data.

• The developer designs Django models to ﬁt this data and then opens up the admin site to the reporter.

• The reporter inspects the admin site to point out any missing or extraneous ﬁelds – better now than later. The

developer changes the models iteratively.

• When the models are agreed upon, the reporter begins entering data using the admin site. At the same time, the

programmer can focus on developing the publicly accessible views/templates (the fun part!).

In other words, the raison d’être of Django’s admin interface is facilitating the simultaneous work of content producers

and programmers.

However, beyond these obvious data entry tasks, the admin site is useful in a few other cases:

•Inspecting data models: Once you’ve deﬁned a few models, it can be quite useful to call them up in the ad-

min interface and enter some dummy data. In some cases, this might reveal data-modeling mistakes or other

problems with your models.

•Managing acquired data: For applications that rely on data coming from external sources (e.g., users or Web

crawlers), the admin site gives you an easy way to inspect or edit this data. You can think of it as a less powerful,

but more convenient, version of your database’s command-line utility.

•Quick and dirty data-management apps: You can use the admin site to build yourself a very lightweight data

management app – say, to keep track of expenses. If you’re just building something for your own needs, not for

public consumption, the admin site can take you a long way. In this sense, you can think of it as a beefed up,

relational version of a spreadsheet.

One ﬁnal point we want to make clear is: the admin site is not an end-all-be-all. Over the years, we’ve seen it hacked

and chopped up to serve a variety of functions it wasn’t intended to serve. It’s not intended to be a public interface to

data, nor is it intended to allow for sophisticated sorting and searching of your data. As we said early in this chapter,

it’s for trusted site administrators. Keeping this sweet spot in mind is the key to effective admin-site usage.

8.10. When and Why to Use the Admin Interface – And When Not to 103

The Django Book, Release 2.0

8.11 What’s Next?

So far we’ve created a few models and conﬁgured a top-notch interface for editing data. In the next chapter Chapter 7,

we’ll move on to the real “meat and potatoes” of Web development: form creation and processing.

104 Chapter 8. Chapter 6: The Django Admin Site

CHAPTER 9

Chapter 7: Forms

HTML forms are the backbone of interactive Web sites, from the simplicity of Google’s single search box to ubiq-

uitous blog comment-submission forms to complex custom data-entry interfaces. This chapter covers how you can

use Django to access user-submitted form data, validate it and do something with it. Along the way, we’ll cover

HttpRequest and Form objects.

9.1 Getting Data From the Request Object

We introduced HttpRequest objects in Chapter 3 when we ﬁrst covered view functions, but we didn’t have much

to say about them at the time. Recall that each view function takes an HttpRequest object as its ﬁrst parameter, as

in our hello() view:

from django.http import HttpResponse

def hello(request):

return HttpResponse("Hello world")

HttpRequest objects, such as the variable request here, have a number of interesting attributes and methods that

you should familiarize yourself with, so that you know what’s possible. You can use these attributes to get information

about the current request (i.e., the user/Web browser that’s loading the current page on your Django-powered site), at

the time the view function is executed.

9.1.1 Information About the URL

HttpRequest objects contain several pieces of information about the currently requested URL:

Attribute/method Description Example

request.path The full path, not including the domain but

including the leading slash.

"/hello/"

request.get_host()The host (i.e., the “domain,” in common

parlance).

"127.0.0.1:8000" or

"www.example.com"

request.get_full_path()The path, plus a query string (if available). "/hello/?print=true"

request.is_secure()True if the request was made via HTTPS.

Otherwise, False.

True or False

Always use these attributes/methods instead of hard-coding URLs in your views. This makes for more ﬂexible code

that can be reused in other places. A simplistic example:

105

The Django Book, Release 2.0

# BAD!

def current_url_view_bad(request):

return HttpResponse("Welcome to the page at /current/")

# GOOD

def current_url_view_good(request):

return HttpResponse("Welcome to the page at %s"%request.path)

9.1.2 Other Information About the Request

request.META is a Python dictionary containing all available HTTP headers for the given request – including the

user’s IP address and user agent (generally the name and version of the Web browser). Note that the full list of available

headers depends on which headers the user sent and which headers your Web server sets. Some commonly available

keys in this dictionary are:

•HTTP_REFERER – The referring URL, if any. (Note the misspelling of REFERER.)

•HTTP_USER_AGENT – The user’s browser’s user-agent string, if any. This looks something like:

"Mozilla/5.0 (X11; U; Linux i686; fr-FR; rv:1.8.1.17) Gecko/20080829

Firefox/2.0.0.17".

•REMOTE_ADDR – The IP address of the client, e.g., "12.345.67.89". (If the request has

passed through any proxies, then this might be a comma-separated list of IP addresses, e.g.,

"12.345.67.89,23.456.78.90".)

Note that because request.META is just a basic Python dictionary, you’ll get a KeyError exception if you try to

access a key that doesn’t exist. (Because HTTP headers are external data – that is, they’re submitted by your users’

browsers – they shouldn’t be trusted, and you should always design your application to fail gracefully if a particular

header is empty or doesn’t exist.) You should either use a try/except clause or the get() method to handle the

case of undeﬁned keys:

# BAD!

def ua_display_bad(request):

ua =request.META[’HTTP_USER_AGENT’]# Might raise KeyError!

return HttpResponse("Your browser is %s"%ua)

# GOOD (VERSION 1)

def ua_display_good1(request):

try:

ua =request.META[’HTTP_USER_AGENT’]

except KeyError:

ua =’unknown’

return HttpResponse("Your browser is %s"%ua)

# GOOD (VERSION 2)

def ua_display_good2(request):

ua =request.META.get(’HTTP_USER_AGENT’,’unknown’)

return HttpResponse("Your browser is %s"%ua)

We encourage you to write a small view that displays all of the request.META data so you can get to know what’s

in there. Here’s what that view might look like:

def display_meta(request):

values =request.META.items()

values.sort()

html =[]

for k, v in values:

106 Chapter 9. Chapter 7: Forms

The Django Book, Release 2.0

html.append(’<tr><td>%s</td><td>%s</td></tr>’ %(k, v))

return HttpResponse(’<table>%s</table>’ %’\n’.join(html))

As an exercise, see whether you can convert this view to use Django’s template system instead of hard-coding the

HTML. Also try adding request.path and the other HttpRequest methods from the previous section.

9.1.3 Information About Submitted Data

Beyond basic metadata about the request, HttpRequest objects have two attributes that contain information sub-

mitted by the user: request.GET and request.POST. Both of these are dictionary-like objects that give you

access to GET and POST data.

Dictionary-like objects

When we say request.GET and request.POST are “dictionary-like” objects, we mean that they behave like

standard Python dictionaries but aren’t technically dictionaries under the hood. For example, request.GET and

request.POST both have get(),keys() and values() methods, and you can iterate over the keys by doing

for key in request.GET.

So why the distinction? Because both request.GET and request.POST have additional methods that normal

dictionaries don’t have. We’ll get into these in a short while.

You might have encountered the similar term “ﬁle-like objects” – Python objects that have a few basic methods, like

read(), that let them act as stand-ins for “real” ﬁle objects.

POST data generally is submitted from an HTML <form>, while GET data can come from a <form> or the query

string in the page’s URL.

9.2 A Simple Form-Handling Example

Continuing this book’s ongoing example of books, authors and publishers, let’s create a simple view that lets users

search our book database by title.

Generally, there are two parts to developing a form: the HTML user interface and the backend view code that processes

the submitted data. The ﬁrst part is easy; let’s just set up a view that displays a search form:

from django.shortcuts import render

def search_form(request):

return render(request, ’search_form.html’)

As we learned in Chapter 3, this view can live anywhere on your Python path. For sake of argument, put it in

books/views.py.

The accompanying template, search_form.html, could look like this:

<html>

<head>

<title>Search</title>

</head>

<body>

</form>

9.2. A Simple Form-Handling Example 107

The Django Book, Release 2.0

</body>

</html>

The URLpattern in urls.py could look like this:

from mysite.books import views

urlpatterns =patterns(’’,

# ...

url(r’^search-form/$’, views.search_form),

# ...

)

(Note that we’re importing the views module directly, instead of something like from mysite.views import

search_form, because the former is less verbose. We’ll cover this importing approach in more detail in Chapter 8.)

Now, if you run the runserver and visit http://127.0.0.1:8000/search-form/, you’ll see the search

interface. Simple enough.

Try submitting the form, though, and you’ll get a Django 404 error. The form points to the URL /search/, which

hasn’t yet been implemented. Let’s ﬁx that with a second view function:

# urls.py

urlpatterns =patterns(’’,

# ...

(r’^search-form/$’, views.search_form),

(r’^search/$’, views.search),

# ...

)

# views.py

def search(request):

if ’q’ in request.GET:

message =’You searched for: %r’%request.GET[’q’]

else:

message =’You submitted an empty form.’

return HttpResponse(message)

For the moment, this merely displays the user’s search term, so we can make sure the data is being submitted to Django

properly, and so you can get a feel for how the search term ﬂows through the system. In short:

1. The HTML <form> deﬁnes a variable q. When it’s submitted, the value of qis sent via GET

(method="get") to the URL /search/.

2. The Django view that handles the URL /search/ (search()) has access to the qvalue in request.GET.

An important thing to point out here is that we explicitly check that ’q’ exists in request.GET. As we pointed out

in the request.META section above, you shouldn’t trust anything submitted by users or even assume that they’ve

submitted anything in the ﬁrst place. If we didn’t add this check, any submission of an empty form would raise

KeyError in the view:

# BAD!

def bad_search(request):

# The following line will raise KeyError if ’q’ hasn’t

# been submitted!

message =’You searched for: %r’%request.GET[’q’]

return HttpResponse(message)

108 Chapter 9. Chapter 7: Forms

The Django Book, Release 2.0

Query string parameters

Because GET data is passed in the query string (e.g., /search/?q=django), you can use request.GET to access

query string variables. In Chapter 3’s introduction of Django’s URLconf system, we compared Django’s pretty URLs

to more traditional PHP/Java URLs such as /time/plus?hours=3 and said we’d show you how to do the latter

in Chapter 7. Now you know how to access query string parameters in your views (like hours=3 in this example) –

use request.GET.

POST data works the same way as GET data – just use request.POST instead of request.GET. What’s the

difference between GET and POST? Use GET when the act of submitting the form is just a request to “get” data. Use

POST whenever the act of submitting the form will have some side effect – changing data, or sending an e-mail, or

something else that’s beyond simple display of data. In our book-search example, we’re using GET because the query

doesn’t change any data on our server. (See http://www.w3.org/2001/tag/doc/whenToUseGet.html if you want to learn

more about GET and POST.)

Now that we’ve veriﬁed request.GET is being passed in properly, let’s hook the user’s search query into our book

database (again, in views.py):

from django.http import HttpResponse

from django.shortcuts import render

from mysite.books.models import Book

def search(request):

if ’q’ in request.GET and request.GET[’q’]:

q=request.GET[’q’]

books =Book.objects.filter(title__icontains=q)

return render(request, ’search_results.html’,

{’books’: books, ’query’: q})

else:

return HttpResponse(’Please submit a search term.’)

A couple of notes on what we did here:

• Aside from checking that ’q’ exists in request.GET, we also make sure that request.GET[’q’] is a

non-empty value before passing it to the database query.

• We’re using Book.objects.filter(title__icontains=q) to query our book table for all books

whose title includes the given submission. The icontains is a lookup type (as explained in Chapter 5 and

Appendix B), and the statement can be roughly translated as “Get the books whose title contains q, without

being case-sensitive.”

This is a very simple way to do a book search. We wouldn’t recommend using a simple icontains query on

a large production database, as it can be slow. (In the real world, you’d want to use a custom search system of

some sort. Search the Web for open-source full-text search to get an idea of the possibilities.)

• We pass books, a list of Book objects, to the template. The template code for search_results.html

might include something like this:

<p>You searched for: <strong>{{ query }}</strong></p>

{% if books %}

<p>Found {{ books|length }} book{{ books|pluralize }}.</p>

<ul>

{% for book in books %}

<li>{{ book.title }}</li>

{% endfor %}

</ul>

{% else %}

9.2. A Simple Form-Handling Example 109

The Django Book, Release 2.0

<p>No books matched your search criteria.</p>

{% endif %}

Note usage of the pluralize template ﬁlter, which outputs an “s” if appropriate, based on the number of

books found.

9.3 Improving Our Simple Form-Handling Example

As in previous chapters, we’ve shown you the simplest thing that could possibly work. Now we’ll point out some

problems and show you how to improve it.

First, our search() view’s handling of an empty query is poor – we’re just displaying a "Please submit a

search term." message, requiring the user to hit the browser’s back button. This is horrid and unprofessional,

and if you ever actually implement something like this in the wild, your Django privileges will be revoked.

It would be much better to redisplay the form, with an error above it, so that the user can try again immediately. The

easiest way to do that would be to render the template again, like this:

from django.http import HttpResponse

from django.shortcuts import render

from mysite.books.models import Book

def search_form(request):

return render(request, ‘search_form.html’)

def search(request):

if ‘q’ in request.GET and request.GET[’q’]:

q = request.GET[’q’]

books = Book.objects.filter(title__icontains=q)

return render(request, ‘search_results.html’,

{‘books’: books, ‘query’: q})

else:

return render(request, ‘search_form.html’, {‘error’: True})

(Note that we’ve included search_form() here so you can see both views in one place.)

Here, we’ve improved search() to render the search_form.html template again, if the query is empty. And

because we need to display an error message in that template, we pass a template variable. Now we can edit

search_form.html to check for the error variable:

<html>

<head>

<title>Search</title>

</head>

<body>

{% if error %}

<p style=’’color: red;’’>Please submit a search term.</p>

{% endif %}

</form>

</body>

</html>

110 Chapter 9. Chapter 7: Forms

The Django Book, Release 2.0

We can still use this template from our original view, search_form(), because search_form() doesn’t pass

error to the template – so the error message won’t show up in that case.

With this change in place, it’s a better application, but it now begs the question: is a dedicated search_form()

view really necessary? As it stands, a request to the URL /search/ (without any GET parameters) will display the

empty form (but with an error). We can remove the search_form() view, along with its associated URLpattern, as

long as we change search() to hide the error message when somebody visits /search/ with no GET parameters:

def search(request):

error =False

if ’q’ in request.GET:

q=request.GET[’q’]

if not q:

error =True

else:

books =Book.objects.filter(title__icontains=q)

return render(request, ’search_results.html’,

{’books’: books, ’query’: q})

return render(request, ’search_form.html’,

{’error’: error})

In this updated view, if a user visits /search/ with no GET parameters, he’ll see the search form with no error

message. If a user submits the form with an empty value for ’q’, he’ll see the search form with an error message.

And, ﬁnally, if a user submits the form with a non-empty value for ’q’, he’ll see the search results.

We can make one ﬁnal improvement to this application, to remove a bit of redundancy. Now that we’ve rolled the two

views and URLs into one and /search/ handles both search-form display and result display, the HTML <form>

in search_form.html doesn’t have to hard-code a URL. Instead of this:

It can be changed to this:

The action="" means “Submit the form to the same URL as the current page.” With this change in place, you

won’t have to remember to change the action if you ever hook the search() view to another URL.

9.4 Simple validation

Our search example is still reasonably simple, particularly in terms of its data validation; we’re merely checking to

make sure the search query isn’t empty. Many HTML forms include a level of validation that’s more complex than

making sure the value is non-empty. We’ve all seen the error messages on Web sites:

• “Please enter a valid e-mail address. ‘foo’ is not an e-mail address.”

• “Please enter a valid ﬁve-digit U.S. ZIP code. ‘123’ is not a ZIP code.”

• “Please enter a valid date in the format YYYY-MM-DD.”

• “Please enter a password that is at least 8 characters long and contains at least one number.”

A note on JavaScript validation

This is beyond the scope of this book, but you can use JavaScript to validate data on the client side, directly in the

browser. But be warned: even if you do this, you must validate data on the server side, too. Some people have

JavaScript turned off, and some malicious users might submit raw, unvalidated data directly to your form handler to

see whether they can cause mischief.

9.4. Simple validation 111

The Django Book, Release 2.0

There’s nothing you can do about this, other than always validate user-submitted data server-side (i.e., in your Django

views). You should think of JavaScript validation as a bonus usability feature, not as your only means of validating.

Let’s tweak our search() view so that it validates that the search term is less than or equal to 20 characters long.

(For sake of example, let’s say anything longer than that might make the query too slow.) How might we do that? The

simplest possible thing would be to embed the logic directly in the view, like this:

def search(request):

error = False

if ‘q’ in request.GET:

q = request.GET[’q’]

if not q:

error = True

elif len(q) > 20:

error = True

else:

books = Book.objects.filter(title__icontains=q)

return render(request, ‘search_results.html’,

{‘books’: books, ‘query’: q})

return render(request, ‘search_form.html’,

{‘error’: error})

Now, if you try submitting a search query greater than 20 characters long, it won’t let you search; you’ll get an

error message. But that error message in search_form.html currently says "Please submit a search

term." – so we’ll have to change it to be accurate for both cases:

<html>

<head>

<title>Search</title>

</head>

<body>

{% if error %}

<p style="color: red;">Please submit a search term 20 characters or shorter.</p>

{% endif %}

</form>

</body>

</html>

There’s something ugly about this. Our one-size-ﬁts-all error message is potentially confusing. Why should the

error message for an empty form submission mention anything about a 20-character limit? Error messages should be

speciﬁc, unambiguous and not confusing.

The problem is in the fact that we’re using a simple boolean value for error, whereas we should be using a list of

error message strings. Here’s how we might ﬁx that:

def search(request):

errors = []

if ‘q’ in request.GET:

q = request.GET[’q’]

if not q:

errors.append(‘Enter a search term.’)

elif len(q) > 20:

errors.append(‘Please enter at most 20 characters.’)

else:

112 Chapter 9. Chapter 7: Forms

The Django Book, Release 2.0

books = Book.objects.filter(title__icontains=q)

return render(request, ‘search_results.html’,

{‘books’: books, ‘query’: q})

return render(request, ‘search_form.html’,

{‘errors’: errors})

Then, we need make a small tweak to the search_form.html template to reﬂect that it’s now passed an errors

list instead of an error boolean value:

<html>

<head>

<title>Search</title>

</head>

<body>

{% if errors %}

<ul>

{% for error in errors %}

<li>{{ error }}</li>

{% endfor %}

</ul>

{% endif %}

</form>

</body>

</html>

9.5 Making a Contact Form

Although we iterated over the book search form example several times and improved it nicely, it’s still fundamentally

simple: just a single ﬁeld, ’q’. Because it’s so simple, we didn’t even use Django’s form library to deal with it.

But more complex forms call for more complex treatment – and now we’ll develop something more complex: a site

contact form.

This will be a form that lets site users submit a bit of feedback, along with an optional e-mail return address. After the

form is submitted and the data is validated, we’ll automatically send the message via e-mail to the site staff.

We’ll start with our template, contact_form.html.

<html>

<head>

<title>Contact us</title>

</head>

<body>

<h1>Contact us</h1>

{% if errors %}

<ul>

{% for error in errors %}

<li>{{ error }}</li>

{% endfor %}

</ul>

{% endif %}

9.5. Making a Contact Form 113

The Django Book, Release 2.0

<p>Subject: <input type="text" name="subject"></p>

<p>Your e-mail (optional): <input type="text" name="email"></p>

<p>Message: <textarea name="message" rows="10" cols="50"></textarea></p>

</form>

</body>

</html>

We’ve deﬁned three ﬁelds: the subject, e-mail address and message. The second is optional, but the other two

ﬁelds are required. Note we’re using method="post" here instead of method="get" because this form sub-

mission has a side effect – it sends an e-mail. Also, we copied the error-displaying code from our previous template

search_form.html.

If we continue down the road established by our search() view from the previous section, a naive version of our

contact() view might look like this:

from django.core.mail import send_mail

from django.http import HttpResponseRedirect

from django.shortcuts import render

def contact(request):

errors =[]

if request.method == ’POST’:

if not request.POST.get(’subject’,’’):

errors.append(’Enter a subject.’)

if not request.POST.get(’message’,’’):

errors.append(’Enter a message.’)

if request.POST.get(’email’)and ’@’ not in request.POST[’email’]:

errors.append(’Enter a valid e-mail address.’)

if not errors:

send_mail(

request.POST[’subject’],

request.POST[’message’],

request.POST.get(’email’,’noreply@example.com’),

[’siteowner@example.com’],

)

return HttpResponseRedirect(’/contact/thanks/’)

return render(request, ’contact_form.html’,

{’errors’: errors})

(If you’re following along, you may be wondering whether to put this view in the books/views.py ﬁle. It doesn’t

have anything to do with the books application, so should it live elsewhere? It’s totally up to you; Django doesn’t care,

as long as you’re able to point to the view from your URLconf. Our personal preference would be to create a separate

directory, contact, at the same level in the directory tree as books. This would contain an empty __init__.py

and views.py.)

A couple of new things are happening here:

• We’re checking that request.method is ’POST’. This will only be true in the case of a form submission; it

won’t be true if somebody is merely viewing the contact form. (In the latter case, request.method will

be set to ’GET’, because in normal Web browsing, browsers use GET, not POST.) This makes it a nice

way to isolate the “form display” case from the “form processing” case.

• Instead of request.GET, we’re using request.POST to access the submitted form data. This is necessary

because the HTML <form> in contact_form.html uses method="post". If this view is accessed via

POST, then request.GET will be empty.

• This time, we have two required ﬁelds, subject and message, so we have to validate both. Note that we’re

using request.POST.get() and providing a blank string as the default value; this is a nice, short way of

114 Chapter 9. Chapter 7: Forms

The Django Book, Release 2.0

handling both the cases of missing keys and missing data.

• Although the email ﬁeld is not required, we still validate it if it is indeed submitted. Our validation algorithm

here is fragile – we’re just checking that the string contains an @character. In the real world, you’d want more

robust validation (and Django provides it, which we’ll show you very shortly).

• We’re using the function django.core.mail.send_mail to send an e-mail. This function has four re-

quired arguments: the e-mail subject, the e-mail body, the “from” address, and a list of recipient addresses.

send_mail is a convenient wrapper around Django’s EmailMessage class, which provides advanced fea-

tures such as attachments, multipart e-mails, and full control over e-mail headers.

Note that in order to send e-mail using send_mail(), your server must be conﬁgured to send mail, and

Django must be told about your outbound e-mail server. See http://docs.djangoproject.com/en/dev/topics/email/

for the speciﬁcs.

• After the e-mail is sent, we redirect to a “success” page by returning an HttpResponseRedirect object.

We’ll leave the implementation of that “success” page up to you (it’s a simple view/URLconf/template), but we

should explain why we initiate a redirect instead of, for example, simply calling render() with a template

right there.

The reason: if a user hits “Refresh” on a page that was loaded via POST, that request will be repeated. This can

often lead to undesired behavior, such as a duplicate record being added to the database – or, in our example,

the e-mail being sent twice. If the user is redirected to another page after the POST, then there’s no chance of

repeating the request.

You should always issue a redirect for successful POST requests. It’s a Web development best practice.

This view works, but those validation functions are kind of crufty. Imagine processing a form with a dozen ﬁelds;

would you really want to have to write all of those if statements?

Another problem is form redisplay. In the case of validation errors, it’s best practice to redisplay the form with the

previously submitted data already ﬁlled in, so the user can see what he did wrong (and also so the user doesn’t have to

reenter data in ﬁelds that were submitted correctly). We could manually pass the POST data back to the template, but

we’d have to edit each HTML ﬁeld to insert the proper value in the proper place:

# views.py

def contact(request):

errors = []

if request.method == ‘POST’:

if not request.POST.get(‘subject’, ‘’):

errors.append(‘Enter a subject.’)

if not request.POST.get(‘message’, ‘’):

errors.append(‘Enter a message.’)

if request.POST.get(‘email’) and ‘@’ not in request.POST[’email’]:

errors.append(‘Enter a valid e-mail address.’)

if not errors:

send_mail(

request.POST[’subject’],

request.POST[’message’],

request.POST.get(‘email’, ‘noreply@example.com‘),

[‘siteowner@example.com‘],

)

return HttpResponseRedirect(‘/contact/thanks/’)

return render(request, ‘contact_form.html’, {

‘errors’: errors,

‘subject’: request.POST.get(‘subject’, ‘’),

‘message’: request.POST.get(‘message’, ‘’),

9.5. Making a Contact Form 115

The Django Book, Release 2.0

‘email’: request.POST.get(‘email’, ‘’),

})

# contact_form.html

<html>

<head>

<title>Contact us</title>

</head>

<body>

<h1>Contact us</h1>

{% if errors %}

<ul>

{% for error in errors %}

<li>{{ error }}</li>

{% endfor %}

</ul>

{% endif %}

<p>Subject: <input type=’’text’’ name=’’subject’’ value=’’{{ subject }}’’></p>

<p>Your e-mail (optional): <input type=’’text’’ name=’’email’’ value=’’{{ email }}’’></p>

<p>Message: <textarea name=’’message’’ rows=’’10’’ cols=’’50’’>**{{ message }}**</textarea></p>

</form>

</body>

</html>

This is a lot of cruft, and it introduces a lot of opportunities for human error. We hope you’re starting to see the

opportunity for some higher-level library that handles form- and validation-related tasks.

9.6 Your First Form Class

Django comes with a form library, called django.forms, that handles many of the issues we’ve been exploring

this chapter – from HTML form display to validation. Let’s dive in and rework our contact form application using the

Django forms framework.

Django’s “newforms” library

Throughout the Django community, you might see chatter about something called django.newforms. When

people speak of django.newforms, they’re talking about what is now django.forms – the library covered by

this chapter.

The reason for this name change is historic. When Django was ﬁrst released to the public, it had a compli-

cated, confusing forms system, django.forms. It was completely rewritten, and the new version was called

django.newforms so that people could still use the old system. When Django 1.0 was released, the old

django.forms went away, and django.newforms became django.forms.

The primary way to use the forms framework is to deﬁne a Form class for each HTML <form> you’re dealing

with. In our case, we only have one <form>, so we’ll have one Form class. This class can live anywhere you want

– including directly in your views.py ﬁle – but community convention is to keep Form classes in a separate ﬁle

called forms.py. Create this ﬁle in the same directory as your views.py, and enter the following:

116 Chapter 9. Chapter 7: Forms

The Django Book, Release 2.0

from django import forms

class ContactForm(forms.Form):

subject =forms.CharField()

email =forms.EmailField(required=False)

message =forms.CharField()

This is pretty intuitive, and it’s similar to Django’s model syntax. Each ﬁeld in the form is represented by a type of

Field class – CharField and EmailField are the only types of ﬁelds used here – as attributes of a Form class.

Each ﬁeld is required by default, so to make email optional, we specify required=False.

Let’s hop into the Python interactive interpreter and see what this class can do. The ﬁrst thing it can do is display itself

as HTML:

>>> from contact.forms import ContactForm

>>> f=ContactForm()

>>> print f

<tr><th><label for="id_subject">Subject:</label></th><td><input type="text" name="subject" id="id_subject" /></td></tr>

<tr><th><label for="id_email">Email:</label></th><td><input type="text" name="email" id="id_email" /></td></tr>

<tr><th><label for="id_message">Message:</label></th><td><input type="text" name="message" id="id_message" /></td></tr>

Django adds a label to each ﬁeld, along with <label> tags for accessibility. The idea is to make the default behavior

as optimal as possible.

This default output is in the format of an HTML <table>, but there are a few other built-in outputs:

>>> print f.as_ul()

<li><label for="id_subject">Subject:</label> <input type="text" name="subject" id="id_subject" /></li>

<li><label for="id_email">Email:</label> <input type="text" name="email" id="id_email" /></li>

<li><label for="id_message">Message:</label> <input type="text" name="message" id="id_message" /></li>

>>> print f.as_p()

<p><label for="id_subject">Subject:</label> <input type="text" name="subject" id="id_subject" /></p>

<p><label for="id_email">Email:</label> <input type="text" name="email" id="id_email" /></p>

<p><label for="id_message">Message:</label> <input type="text" name="message" id="id_message" /></p>

Note that the opening and closing <table>,<ul> and <form> tags aren’t included in the output, so that you can

add any additional rows and customization if necessary.

These methods are just shortcuts for the common case of “display the entire form.” You can also display the HTML

for a particular ﬁeld:

>>> print f[’subject’]

>>> print f[’message’]

The second thing Form objects can do is validate data. To validate data, create a new Form object and pass it a

dictionary of data that maps ﬁeld names to data:

>>> f=ContactForm({’subject’:’Hello’,’email’:’adrian@example.com’,’message’:’Nice site!’})

Once you’ve associated data with a Form instance, you’ve created a “bound” form:

>>> f.is_bound

True

Call the is_valid() method on any bound Form to ﬁnd out whether its data is valid. We’ve passed a valid value

for each ﬁeld, so the Form in its entirety is valid:

9.6. Your First Form Class 117

The Django Book, Release 2.0

>>> f.is_valid()

True

If we don’t pass the email ﬁeld, it’s still valid, because we’ve speciﬁed required=False for that ﬁeld:

>>> f=ContactForm({’subject’:’Hello’,’message’:’Nice site!’})

>>> f.is_valid()

True

But, if we leave off either subject or message, the Form is no longer valid:

>>> f=ContactForm({’subject’:’Hello’})

>>> f.is_valid()

False

>>> f=ContactForm({’subject’:’Hello’,’message’:’’})

>>> f.is_valid()

False

You can drill down to get ﬁeld-speciﬁc error messages:

>>> f=ContactForm({’subject’:’Hello’,’message’:’’})

>>> f[’message’].errors

[u’This field is required.’]

>>> f[’subject’].errors

[]

>>> f[’email’].errors

[]

Each bound Form instance has an errors attribute that gives you a dictionary mapping ﬁeld names to error-message

lists:

>>> f=ContactForm({’subject’:’Hello’,’message’:’’})

>>> f.errors

{’message’: [u’This field is required.’]}

Finally, for Form instances whose data has been found to be valid, a cleaned_data attribute is available. This is

a dictionary of the submitted data, “cleaned up.” Django’s forms framework not only validates data, it cleans it up by

converting values to the appropriate Python types.

>>> f=ContactForm({’subject’:’Hello’,’email’:’adrian@example.com’,’message’:’Nice site!’})

>>> f.is_valid()

True

>>> f.cleaned_data

{’message’: u’Nice site!’, ’email’: u’adrian@example.com’, ’subject’: u’Hello’}

Our contact form only deals with strings, which are “cleaned” into Unicode objects – but if we were to use an

IntegerField or DateField, the forms framework would ensure that cleaned_data used proper Python

integers or datetime.date objects for the given ﬁelds.

9.7 Tying Form Objects Into Views

With some basic knowledge about Form classes, you might see how we can use this infrastructure to replace some of

the cruft in our contact() view. Here’s how we can rewrite contact() to use the forms framework:

# views.py

from django.shortcuts import render

from mysite.contact.forms import ContactForm

118 Chapter 9. Chapter 7: Forms

The Django Book, Release 2.0

from django.http import HttpResponseRedirect

from django.core.mail import send_mail

def contact(request):

if request.method == ’POST’:

form = ContactForm(request.POST)

if form.is_valid():

cd = form.cleaned_data

send_mail(

cd[’subject’],

cd[’message’],

cd.get(’email’, ’noreply@example.com’),

[’siteowner@example.com’],

)

return HttpResponseRedirect(’/contact/thanks/’)

else:

form = ContactForm()

return render(request, ’contact_form.html’, {’form’: form})

# contact_form.html

<html>

<head>

<title>Contact us</title>

</head>

<body>

<h1>Contact us</h1>

{% if form.errors %}

Please correct the error{{ form.errors|pluralize }} below.

</p>

{% endif %}

<table>

</table>

{% csrf_token %}

</form>

</body>

</html>

Look at how much cruft we’ve been able to remove! Django’s forms framework handles the HTML display, the

validation, data cleanup and form redisplay-with-errors.

Since we’re creating a POST form (which can have the effect of modifying data), we need to worry about Cross Site Re-

quest Forgeries. Thankfully, you don’t have to worry too hard, because Django comes with a very easy-to-use system

for protecting against it. In short, all POST forms that are targeted at internal URLs should use the {% csrf_token

%} template tag. More details about {% csrf_token %} can be found in Chapter 16: django.contrib and Chapter

20: Security.

Try running this locally. Load the form, submit it with none of the ﬁelds ﬁlled out, submit it with an invalid e-mail

address, then ﬁnally submit it with valid data. (Of course, depending on your mail-server conﬁguration, you might get

an error when send_mail() is called, but that’s another issue.)

9.7. Tying Form Objects Into Views 119

The Django Book, Release 2.0

9.8 Changing How Fields Are Rendered

Probably the ﬁrst thing you’ll notice when you render this form locally is that the message ﬁeld is displayed as an

<input type="text">, and it ought to be a <textarea>. We can ﬁx that by setting the ﬁeld’s widget:

from django import forms

class ContactForm(forms.Form):

subject = forms.CharField()

email = forms.EmailField(required=False)

message = forms.CharField(widget=forms.Textarea)

The forms framework separates out the presentation logic for each ﬁeld into a set of widgets. Each ﬁeld type has a

default widget, but you can easily override the default, or provide a custom widget of your own.

Think of the Field classes as representing validation logic, while widgets represent presentation logic.

9.9 Setting a Maximum Length

One of the most common validation needs is to check that a ﬁeld is of a certain size. For good measure, we should

improve our ContactForm to limit the subject to 100 characters. To do that, just supply a max_length to the

CharField, like this:

from django import forms

class ContactForm(forms.Form):

subject = forms.CharField(max_length=100)

email = forms.EmailField(required=False)

message = forms.CharField(widget=forms.Textarea)

An optional min_length argument is also available.

9.10 Setting Initial Values

As an improvement to this form, let’s add an initial value for the subject ﬁeld: "I love your site!" (A little

power of suggestion can’t hurt.) To do this, we can use the initial argument when we create a Form instance:

def contact(request):

if request.method == ‘POST’:

form = ContactForm(request.POST)

if form.is_valid():

cd = form.cleaned_data

send_mail(

cd[’subject’],

cd[’message’],

cd.get(‘email’, ‘noreply@example.com‘),

[‘siteowner@example.com‘],

)

return HttpResponseRedirect(‘/contact/thanks/’)

else:

form = ContactForm(

initial={‘subject’: ‘I love your site!’}

)

120 Chapter 9. Chapter 7: Forms

The Django Book, Release 2.0

return render(request, ‘contact_form.html’, {‘form’: form})

Now, the subject ﬁeld will be displayed prepopulated with that kind statement.

Note that there is a difference between passing initial data and passing data that binds the form. The biggest difference

is that if you’re just passing initial data, then the form will be unbound, which means it won’t have any error messages.

9.11 Custom Validation Rules

Imagine we’ve launched our feedback form, and the e-mails have started tumbling in. There’s just one problem: some

of the submitted messages are just one or two words, which isn’t long enough for us to make sense of. We decide to

adopt a new validation policy: four words or more, please.

There are a number of ways to hook custom validation into a Django form. If our rule is something we will reuse

again and again, we can create a custom ﬁeld type. Most custom validations are one-off affairs, though, and can be

tied directly to the Form class.

We want additional validation on the message ﬁeld, so we add a clean_message() method to our Form class:

from django import forms

class ContactForm(forms.Form):

subject =forms.CharField(max_length=100)

email =forms.EmailField(required=False)

message =forms.CharField(widget=forms.Textarea)

def clean_message(self):

message =self.cleaned_data[’message’]

num_words =len(message.split())

if num_words <4:

raise forms.ValidationError("Not enough words!")

return message

Django’s form system automatically looks for any method whose name starts with clean_ and ends with the name

of a ﬁeld. If any such method exists, it’s called during validation.

Speciﬁcally, the clean_message() method will be called after the default validation logic for a given ﬁeld (in this

case, the validation logic for a required CharField). Because the ﬁeld data has already been partially processed,

we pull it out of self.cleaned_data. Also, we don’t have to worry about checking that the value exists and is

non-empty; that’s done by the default validator.

We naively use a combination of len() and split() to count the number of words. If the user has entered too few

words, we raise a forms.ValidationError. The string attached to this exception will be displayed to the user

as an item in the error list.

It’s important that we explicitly return the cleaned value for the ﬁeld at the end of the method. This allows us to

modify the value (or convert it to a different Python type) within our custom validation method. If we forget the return

statement, then None will be returned, and the original value will be lost.

9.12 Specifying labels

By default, the labels on Django’s auto-generated form HTML are created by replacing underscores with spaces

and capitalizing the ﬁrst letter – so the label for the email ﬁeld is "Email". (Sound familiar? It’s the same simple

algorithm that Django’s models use to calculate default verbose_name values for ﬁelds. We covered this in Chapter

5.)

9.11. Custom Validation Rules 121

The Django Book, Release 2.0

But, as with Django’s models, we can customize the label for a given ﬁeld. Just use label, like so:

class ContactForm(forms.Form):

subject = forms.CharField(max_length=100)

email = forms.EmailField(required=False, label=’Your e-mail address’)

message = forms.CharField(widget=forms.Textarea)

9.13 Customizing Form Design

Our contact_form.html template uses {{ form.as_table }} to display the form, but we can display the

form in other ways to get more granular control over display.

The quickest way to customize forms’ presentation is with CSS. Error lists, in particular, could do with some visual

enhancement, and the auto-generated error lists use <ul class="errorlist"> precisely so that you can target

them with CSS. The following CSS really makes our errors stand out:

ul.errorlist {

margin: 0;

padding: 0;

}

.errorlist li {

background-color: red;

color: white;

display: block;

font-size: 10px;

margin: 0 0 3px;

padding: 4px 5px;

}

</style>

While it’s convenient to have our form’s HTML generated for us, in many cases you’ll want to override the default

rendering. {{ form.as_table }} and friends are useful shortcuts while you develop your application, but ev-

erything about the way a form is displayed can be overridden, mostly within the template itself, and you’ll probably

ﬁnd yourself doing this.

Each ﬁeld’s widget (<input type="text">,<select>,<textarea>, etc.) can be rendered individually by

accessing {{ form.fieldname }} in the template, and any errors associated with a ﬁeld are available as {{

form.fieldname.errors }}. With this in mind, we can construct a custom template for our contact form with

the following template code:

<html>

<head>

<title>Contact us</title>

</head>

<body>

<h1>Contact us</h1>

{% if form.errors %}

Please correct the error{{ form.errors|pluralize }} below.

</p>

{% endif %}

122 Chapter 9. Chapter 7: Forms

The Django Book, Release 2.0

<label for="id_subject">Subject:</label>

</div>

<label for="id_email">Your e-mail address:</label>

</div>

<label for="id_message">Message:</label>

</div>

</form>

</body>

</html>

{{ form.message.errors }} displays a <ul class="errorlist"> if errors are present and a blank

string if the ﬁeld is valid (or the form is unbound). We can also treat form.message.errors as a Boolean or even

iterate over it as a list. For example:

{% if form.message.errors %}

<ul>

{% for error in form.message.errors %}

<li><strong>{{ error }}</strong></li>

{% endfor %}

</ul>

{% endif %}

<label for="id_message">Message:</label>

</div>

In the case of validation errors, this will add an “errors” class to the containing <div> and display the list of errors in

an unordered list.

9.14 What’s Next?

This chapter concludes the introductory material in this book – the so-called “core curriculum.” The next section of

the book, Chapters 8 to 12, goes into more detail about advanced Django usage, including how to deploy a Django

application (Chapter 12).

After these ﬁrst seven chapters, you should know enough to start writing your own Django projects. The rest of the

material in this book will help ﬁll in the missing pieces as you need them.

We’ll start in Chapter 8, by doubling back and taking a closer look at views and URLconfs (introduced ﬁrst in chap-

ter03).

9.14. What’s Next? 123

The Django Book, Release 2.0

124 Chapter 9. Chapter 7: Forms

CHAPTER 10

Chapter 8: Advanced Views and URLconfs

In Chapter 3, we explained the basics of Django view functions and URLconfs. This chapter goes into more detail

about advanced functionality in those two pieces of the framework.

10.1 URLconf Tricks

There’s nothing “special” about URLconfs – like anything else in Django, they’re just Python code. You can take

advantage of this in several ways, as described in the sections that follow.

10.1.1 Streamlining Function Imports

Consider this URLconf, which builds on the example in Chapter 3:

from django.conf.urls.defaults import *

from mysite.views import hello, current_datetime, hours_ahead

urlpatterns =patterns(’’,

(r’^hello/$’, hello),

(r’^time/$’, current_datetime),

(r’^time/plus/(\d{1,2})/$’, hours_ahead),

)

As explained in Chapter 3, each entry in the URLconf includes its associated view function, passed directly as a

function object. This means it’s necessary to import the view functions at the top of the module.

But as a Django application grows in complexity, its URLconf grows, too, and keeping those imports can be tedious to

manage. (For each new view function, you have to remember to import it, and the import statement tends to get overly

long if you use this approach.) It’s possible to avoid that tedium by importing the views module itself. This example

URLconf is equivalent to the previous one:

from django.conf.urls.defaults import *

from mysite import views

urlpatterns = patterns(‘’,

(r’^hello/$’, views.hello),

(r’^time/$’, views.current_datetime),

(r’^time/plus/(d{1,2})/$’, views.hours_ahead),

)

125

The Django Book, Release 2.0

Django offers another way of specifying the view function for a particular pattern in the URLconf: you can pass a

string containing the module name and function name rather than the function object itself. Continuing the ongoing

example:

from django.conf.urls.defaults import *

urlpatterns = patterns(‘’,

(r’^hello/$’, ‘mysite.views.hello’),

(r’^time/$’, ‘mysite.views.current_datetime’),

(r’^time/plus/(d{1,2})/$’, ‘mysite.views.hours_ahead’),

)

(Note the quotes around the view names. We’re using ’mysite.views.current_datetime’ – with quotes –

instead of mysite.views.current_datetime.)

Using this technique, it’s no longer necessary to import the view functions; Django automatically imports the ap-

propriate view function the ﬁrst time it’s needed, according to the string describing the name and path of the view

function.

A further shortcut you can take when using the string technique is to factor out a common “view preﬁx.” In our

URLconf example, each of the view strings starts with ’mysite.views’, which is redundant to type. We can

factor out that common preﬁx and pass it as the ﬁrst argument to patterns(), like this:

from django.conf.urls.defaults import *

urlpatterns = patterns(‘mysite.views’,

(r’^hello/$’, ‘hello’),

(r’^time/$’, ‘current_datetime’),

(r’^time/plus/(d{1,2})/$’, ‘hours_ahead’),

)

Note that you don’t put a trailing dot (".") in the preﬁx, nor do you put a leading dot in the view strings. Django puts

those in automatically.

With these two approaches in mind, which is better? It really depends on your personal coding style and needs.

Advantages of the string approach are as follows:

• It’s more compact, because it doesn’t require you to import the view functions.

• It results in more readable and manageable URLconfs if your view functions are spread across several different

Python modules.

Advantages of the function object approach are as follows:

• It allows for easy “wrapping” of view functions. See the section “Wrapping View Functions” later in this

chapter.

• It’s more “Pythonic” – that is, it’s more in line with Python traditions, such as passing functions as objects.

Both approaches are valid, and you can even mix them within the same URLconf. The choice is yours.

10.1.2 Using Multiple View Preﬁxes

In practice, if you use the string technique, you’ll probably end up mixing views to the point where the views in your

URLconf won’t have a common preﬁx. However, you can still take advantage of the view preﬁx shortcut to remove

duplication. Just add multiple patterns() objects together, like this:

Old:

126 Chapter 10. Chapter 8: Advanced Views and URLconfs

The Django Book, Release 2.0

from django.conf.urls.defaults import *

urlpatterns =patterns(’’,

(r’^hello/$’,’mysite.views.hello’),

(r’^time/$’,’mysite.views.current_datetime’),

(r’^time/plus/(\d{1,2})/$’,’mysite.views.hours_ahead’),

(r’^tag/(\w+)/$’,’weblog.views.tag’),

)

New:

from django.conf.urls.defaults import *

urlpatterns =patterns(’mysite.views’,

(r’^hello/$’,’hello’),

(r’^time/$’,’current_datetime’),

(r’^time/plus/(\d{1,2})/$’,’hours_ahead’),

)

urlpatterns += patterns(’weblog.views’,

(r’^tag/(\w+)/$’,’tag’),

)

All the framework cares about is that there’s a module-level variable called urlpatterns. This variable can be

constructed dynamically, as we do in this example. We should speciﬁcally point out that the objects returned by

patterns() can be added together, which is something you might not have expected.

10.1.3 Special-Casing URLs in Debug Mode

Speaking of constructing urlpatterns dynamically, you might want to take advantage of this technique to alter

your URLconf’s behavior while in Django’s debug mode. To do this, just check the value of the DEBUG setting at

runtime, like so:

from django.conf import settings

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

(r’^$’, views.homepage),

(r’^(\d{4})/([a-z]{3})/$’, views.archive_month),

)

if settings.DEBUG:

urlpatterns += patterns(’’,

(r’^debuginfo/$’, views.debug),

)

In this example, the URL /debuginfo/ will only be available if your DEBUG setting is set to True.

10.1.4 Using Named Groups

In all of our URLconf examples so far, we’ve used simple, non-named regular expression groups – that is, we put

parentheses around parts of the URL we wanted to capture, and Django passes that captured text to the view function

as a positional argument. In more advanced usage, it’s possible to use named regular expression groups to capture

URL bits and pass them as keyword arguments to a view.

10.1. URLconf Tricks 127

The Django Book, Release 2.0

Keyword Arguments vs. Positional Arguments

A Python function can be called using keyword arguments or positional arguments – and, in some cases, both at the

same time. In a keyword argument call, you specify the names of the arguments along with the values you’re passing.

In a positional argument call, you simply pass the arguments without explicitly specifying which argument matches

which value; the association is implicit in the arguments’ order.

For example, consider this simple function:

def sell(item, price, quantity):

print "Selling %s unit(s) of %s at %s"%(quantity, item, price)

To call it with positional arguments, you specify the arguments in the order in which they’re listed in the function

deﬁnition:

sell(’Socks’,’$2.50’,6)

To call it with keyword arguments, you specify the names of the arguments along with the values. The following

statements are equivalent:

sell(item=’Socks’, price=’$2.50’, quantity=6)

sell(item=’Socks’, quantity=6, price=’$2.50’)

sell(price=’$2.50’, item=’Socks’, quantity=6)

sell(price=’$2.50’, quantity=6, item=’Socks’)

sell(quantity=6, item=’Socks’, price=’$2.50’)

sell(quantity=6, price=’$2.50’, item=’Socks’)

Finally, you can mix keyword and positional arguments, as long as all positional arguments are listed before keyword

arguments. The following statements are equivalent to the previous examples:

sell(’Socks’,’$2.50’, quantity=6)

sell(’Socks’, price=’$2.50’, quantity=6)

sell(’Socks’, quantity=6, price=’$2.50’)

In Python regular expressions, the syntax for named regular expression groups is (?P<name>pattern), where

name is the name of the group and pattern is some pattern to match.

Here’s an example URLconf that uses non-named groups:

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

(r’^articles/(\d{4})/$’, views.year_archive),

(r’^articles/(\d{4})/(\d{2})/$’, views.month_archive),

)

Here’s the same URLconf, rewritten to use named groups:

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

(r’^articles/(?P<year>\d{4})/$’, views.year_archive),

(r’^articles/(?P<year>\d{4})/(?P<month>\d{2})/$’, views.month_archive),

)

This accomplishes exactly the same thing as the previous example, with one subtle difference: the captured values are

passed to view functions as keyword arguments rather than positional arguments.

128 Chapter 10. Chapter 8: Advanced Views and URLconfs

The Django Book, Release 2.0

For example, with non-named groups, a request to /articles/2006/03/ would result in a function call equivalent

to this:

month_archive(request, ’2006’,’03’)

With named groups, though, the same request would result in this function call:

month_archive(request, year=’2006’, month=’03’)

In practice, using named groups makes your URLconfs slightly more explicit and less prone to argument-order bugs

– and you can reorder the arguments in your views’ function deﬁnitions. Following the preceding example, if we

wanted to change the URLs to include the month before the year, and we were using non-named groups, we’d have to

remember to change the order of arguments in the month_archive view. If we were using named groups, changing

the order of the captured parameters in the URL would have no effect on the view.

Of course, the beneﬁts of named groups come at the cost of brevity; some developers ﬁnd the named-group syntax

ugly and too verbose. Still, another advantage of named groups is readability, especially by those who aren’t intimately

familiar with regular expressions or your particular Django application. It’s easier to see what’s happening, at a glance,

in a URLconf that uses named groups.

10.1.5 Understanding the Matching/Grouping Algorithm

A caveat with using named groups in a URLconf is that a single URLconf pattern cannot contain both named and

non-named groups. If you do this, Django won’t throw any errors, but you’ll probably ﬁnd that your URLs aren’t

matching as you expect. Speciﬁcally, here’s the algorithm the URLconf parser follows, with respect to named groups

vs. non-named groups in a regular expression:

• If there are any named arguments, it will use those, ignoring non-named arguments.

• Otherwise, it will pass all non-named arguments as positional arguments.

• In both cases, it will pass any extra options as keyword arguments. See the next section for more information.

10.1.6 Passing Extra Options to View Functions

Sometimes you’ll ﬁnd yourself writing view functions that are quite similar, with only a few small differences. For

example, say you have two views whose contents are identical except for the template they use:

# urls.py

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

(r’^foo/$’, views.foo_view),

(r’^bar/$’, views.bar_view),

)

# views.py

from django.shortcuts import render

from mysite.models import MyModel

def foo_view(request):

m_list =MyModel.objects.filter(is_new=True)

return render(request, ’template1.html’, {’m_list’: m_list})

10.1. URLconf Tricks 129

The Django Book, Release 2.0

def bar_view(request):

m_list =MyModel.objects.filter(is_new=True)

return render(request, ’template2.html’, {’m_list’: m_list})

We’re repeating ourselves in this code, and that’s inelegant. At ﬁrst, you may think to remove the redundancy by using

the same view for both URLs, putting parentheses around the URL to capture it, and checking the URL within the

view to determine the template, like so:

# urls.py

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

(r’^(foo)/$’, views.foobar_view),

(r’^(bar)/$’, views.foobar_view),

)

# views.py

from django.shortcuts import render

from mysite.models import MyModel

def foobar_view(request, url):

m_list =MyModel.objects.filter(is_new=True)

if url == ’foo’:

template_name =’template1.html’

elif url == ’bar’:

template_name =’template2.html’

return render(request, template_name, {’m_list’: m_list})

The problem with that solution, though, is that it couples your URLs to your code. If you decide to rename /foo/ to

/fooey/, you’ll have to remember to change the view code.

The elegant solution involves an optional URLconf parameter. Each pattern in a URLconf may include a third item: a

dictionary of keyword arguments to pass to the view function.

With this in mind, we can rewrite our ongoing example like this:

# urls.py

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

(r’^foo/$’, views.foobar_view, {’template_name’:’template1.html’}),

(r’^bar/$’, views.foobar_view, {’template_name’:’template2.html’}),

)

# views.py

from django.shortcuts import render

from mysite.models import MyModel

def foobar_view(request, template_name):

m_list =MyModel.objects.filter(is_new=True)

return render(request, template_name, {’m_list’: m_list})

As you can see, the URLconf in this example speciﬁes template_name in the URLconf. The view function treats

130 Chapter 10. Chapter 8: Advanced Views and URLconfs

The Django Book, Release 2.0

it as just another parameter.

This extra URLconf options technique is a nice way of sending additional information to your view functions with

minimal fuss. As such, it’s used by a couple of Django’s bundled applications, most notably its generic views system,

which we cover in Chapter 11.

The following sections contain a couple of ideas on how you can use the extra URLconf options technique in your

own projects.

Faking Captured URLconf Values

Say you have a set of views that match a pattern, along with another URL that doesn’t ﬁt the pattern but whose view

logic is the same. In this case, you can “fake” the capturing of URL values by using extra URLconf options to handle

that extra URL with the same view.

For example, you might have an application that displays some data for a particular day, with URLs such as these:

/mydata/jan/01/

/mydata/jan/02/

/mydata/jan/03/

# ...

/mydata/dec/30/

/mydata/dec/31/

This is simple enough to deal with – you can capture those in a URLconf like this (using named group syntax):

urlpatterns =patterns(’’,

(r’^mydata/(?P<month>\w{3})/(?P<day>\d\d)/$’, views.my_view),

)

And the view function signature would look like this:

def my_view(request, month, day):

# ....

This approach is straightforward – it’s nothing you haven’t seen before. The trick comes in when you want to add

another URL that uses my_view but whose URL doesn’t include a month and/or day.

For example, you might want to add another URL, /mydata/birthday/, which would be equivalent to

/mydata/jan/06/. You can take advantage of extra URLconf options like so:

urlpatterns =patterns(’’,

(r’^mydata/birthday/$’, views.my_view, {’month’:’jan’,’day’:’06’}),

(r’^mydata/(?P<month>\w{3})/(?P<day>\d\d)/$’, views.my_view),

)

The cool thing here is that you don’t have to change your view function at all. The view function only cares that it gets

month and day parameters – it doesn’t matter whether they come from the URL capturing itself or extra parameters.

Making a View Generic

It’s good programming practice to “factor out” commonalities in code. For example, with these two Python functions:

def say_hello(person_name):

print ’Hello, %s’%person_name

def say_goodbye(person_name):

print ’Goodbye, %s’%person_name

10.1. URLconf Tricks 131

The Django Book, Release 2.0

we can factor out the greeting to make it a parameter:

def greet(person_name, greeting):

print ’%s,%s’%(greeting, person_name)

You can apply this same philosophy to your Django views by using extra URLconf parameters.

With this in mind, you can start making higher-level abstractions of your views. Instead of thinking to yourself, “This

view displays a list of Event objects,” and “That view displays a list of BlogEntry objects,” realize they’re both

speciﬁc cases of “A view that displays a list of objects, where the type of object is variable.”

Take this code, for example:

# urls.py

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

(r’^events/$’, views.event_list),

(r’^blog/entries/$’, views.entry_list),

)

# views.py

from django.shortcuts import render

from mysite.models import Event, BlogEntry

def event_list(request):

obj_list =Event.objects.all()

return render(request, ’mysite/event_list.html’, {’event_list’: obj_list})

def entry_list(request):

obj_list =BlogEntry.objects.all()

return render(request, ’mysite/blogentry_list.html’, {’entry_list’: obj_list})

The two views do essentially the same thing: they display a list of objects. So let’s factor out the type of object they’re

displaying:

# urls.py

from django.conf.urls.defaults import *

from mysite import models, views

urlpatterns =patterns(’’,

(r’^events/$’, views.object_list, {’model’: models.Event}),

(r’^blog/entries/$’, views.object_list, {’model’: models.BlogEntry}),

)

# views.py

from django.shortcuts import render

def object_list(request, model):

obj_list =model.objects.all()

template_name =’mysite/%s_list.html’ %model.__name__.lower()

return render(request, template_name, {’object_list’: obj_list})

With those small changes, we suddenly have a reusable, model-agnostic view! From now on, anytime we need a view

that lists a set of objects, we can simply reuse this object_list view rather than writing view code. Here are a

132 Chapter 10. Chapter 8: Advanced Views and URLconfs

The Django Book, Release 2.0

couple of notes about what we did:

• We’re passing the model classes directly, as the model parameter. The dictionary of extra URLconf options

can pass any type of Python object – not just strings.

• The model.objects.all() line is an example of duck typing: “If it walks like a duck and talks like a duck,

we can treat it like a duck.” Note the code doesn’t know what type of object model is; the only requirement is

that model have an objects attribute, which in turn has an all() method.

• We’re using model.__name__.lower() in determining the template name. Every Python class has a

__name__ attribute that returns the class name. This feature is useful at times like this, when we don’t know

the type of class until runtime. For example, the BlogEntry class’s __name__ is the string ’BlogEntry’.

• In a slight difference between this example and the previous example, we’re passing the generic variable name

object_list to the template. We could easily change this variable name to be blogentry_list or

event_list, but we’ve left that as an exercise for the reader.

Because database-driven Web sites have several common patterns, Django comes with a set of “generic views” that

use this exact technique to save you time. We cover Django’s built-in generic views in Chapter 11.

Giving a View Conﬁguration Options

If you’re distributing a Django application, chances are that your users will want some degree of conﬁguration. In this

case, it’s a good idea to add hooks to your views for any conﬁguration options you think people may want to change.

You can use extra URLconf parameters for this purpose.

A common bit of an application to make conﬁgurable is the template name:

def my_view(request, template_name):

var =do_something()

return render(request, template_name, {’var’: var})

Understanding Precedence of Captured Values vs. Extra Options

When there’s a conﬂict, extra URLconf parameters get precedence over captured parameters. In other words, if your

URLconf captures a named-group variable and an extra URLconf parameter includes a variable with the same name,

the extra URLconf parameter value will be used.

For example, consider this URLconf:

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

(r’^mydata/(?P<id>\d+)/$’, views.my_view, {’id’:3}),

)

Here, both the regular expression and the extra dictionary include an id. The hard-coded id gets precedence. That

means any request (e.g., /mydata/2/ or /mydata/432432/) will be treated as if id is set to 3, regardless of the

value captured in the URL.

Astute readers will note that in this case, it’s a waste of time and typing to capture the id in the regular expression,

because its value will always be overridden by the dictionary’s value. That’s correct; we bring this up only to help you

avoid making the mistake.

10.1. URLconf Tricks 133

The Django Book, Release 2.0

10.1.7 Using Default View Arguments

Another convenient trick is to specify default parameters for a view’s arguments. This tells the view which value to

use for a parameter by default if none is speciﬁed.

An example:

# urls.py

from django.conf.urls.defaults import *

from mysite import views

urlpatterns = patterns(’’,

(r’^blog/$’, views.page),

(r’^blog/page(?P<num>\d+)/$’, views.page),

)

# views.py

def page(request, num=’1’):

# Output the appropriate page of blog entries, according to num.

# ...

Here, both URL patterns point to the same view – views.page – but the ﬁrst pattern doesn’t capture anything from

the URL. If the ﬁrst pattern matches, the page() function will use its default argument for num,’1’. If the second

pattern matches, page() will use whatever num value was captured by the regular expression.

(Note that we’ve been careful to set the default argument’s value to the string ’1’, not the integer 1. That’s for

consistency, because any captured value for num will always be a string.)

It’s common to use this technique in conjunction with conﬁguration options, as explained earlier. This example makes

a slight improvement to the example in the “Giving a View Conﬁguration Options” section by providing a default

value for template_name:

def my_view(request, template_name=’mysite/my_view.html’):

var =do_something()

return render(request, template_name, {’var’: var})

10.1.8 Special-Casing Views

Sometimes you’ll have a pattern in your URLconf that handles a large set of URLs, but you’ll need to special-case one

of them. In this case, take advantage of the linear way a URLconf is processed and put the special case ﬁrst.

For example, you can think of the “add an object” pages in Django’s admin site as represented by a URLpattern like

this:

urlpatterns =patterns(’’,

# ...

(’^([^/]+)/([^/]+)/add/$’, views.add_stage),

# ...

)

This matches URLs such as /myblog/entries/add/ and /auth/groups/add/. However, the “add” page

for a user object (/auth/user/add/) is a special case – it doesn’t display all of the form ﬁelds, it displays two

password ﬁelds, and so forth. We could solve this problem by special-casing in the view, like so:

def add_stage(request, app_label, model_name):

if app_label == ’auth’ and model_name == ’user’:

# do special-case code

134 Chapter 10. Chapter 8: Advanced Views and URLconfs

The Django Book, Release 2.0

else:

# do normal code

but that’s inelegant for a reason we’ve touched on multiple times in this chapter: it puts URL logic in the view. As a

more elegant solution, we can take advantage of the fact that URLconfs are processed in order from top to bottom:

urlpatterns =patterns(’’,

# ...

(’^auth/user/add/$’, views.user_add_stage),

(’^([^/]+)/([^/]+)/add/$’, views.add_stage),

# ...

)

With this in place, a request to /auth/user/add/ will be handled by the user_add_stage view. Although that

URL matches the second pattern, it matches the top one ﬁrst. (This is short-circuit logic.)

10.1.9 Capturing Text in URLs

Each captured argument is sent to the view as a plain Python Unicode string, regardless of what sort of match the

regular expression makes. For example, in this URLconf line:

(r’^articles/(?P<year>\d{4})/$’, views.year_archive),

the year argument to views.year_archive() will be a string, not an integer, even though \d{4} will only

match integer strings.

This is important to keep in mind when you’re writing view code. Many built-in Python functions are fussy (and right-

fully so) about accepting only objects of a certain type. A common error is to attempt to create a datetime.date

object with string values instead of integer values:

>>> import datetime

>>> datetime.date(’1993’,’7’,’9’)

Traceback (most recent call last):

...

TypeError: an integer is required

>>> datetime.date(1993,7,9)

datetime.date(1993, 7, 9)

Translated to a URLconf and view, the error looks like this:

# urls.py

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

(r’^articles/(\d{4})/(\d{2})/(\d{2})/$’, views.day_archive),

)

# views.py

import datetime

def day_archive(request, year, month, day):

# The following statement raises a TypeError!

date =datetime.date(year, month, day)

Instead, day_archive() can be written correctly like this:

10.1. URLconf Tricks 135

The Django Book, Release 2.0

def day_archive(request, year, month, day):

date =datetime.date(int(year), int(month), int(day))

Note that int() itself raises a ValueError when you pass it a string that is not composed solely of digits, but we’re

avoiding that error in this case because the regular expression in our URLconf has ensured that only strings containing

digits are passed to the view function.

10.1.10 Determining What the URLconf Searches Against

When a request comes in, Django tries to match the URLconf patterns against the requested URL, as a Python string.

This does not include GET or POST parameters, or the domain name. It also does not include the leading slash,

because every URL has a leading slash.

For example, in a request to http://www.example.com/myapp/, Django will try to match myapp/. In a

request to http://www.example.com/myapp/?page=3, Django will try to match myapp/.

The request method (e.g., POST,GET) is not taken into account when traversing the URLconf. In other words, all

request methods will be routed to the same function for the same URL. It’s the responsibility of a view function to

perform branching based on request method.

10.1.11 Higher-Level Abstractions of View Functions

And speaking of branching based on request method, let’s take a look at how we might build a nice way of doing that.

Consider this URLconf/view layout:

# urls.py

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

# ...

(r’^somepage/$’, views.some_page),

# ...

)

# views.py

from django.http import Http404, HttpResponseRedirect

from django.shortcuts import render

def some_page(request):

if request.method == ’POST’:

do_something_for_post()

return HttpResponseRedirect(’/someurl/’)

elif request.method == ’GET’:

do_something_for_get()

return render(request, ’page.html’)

else:

raise Http404()

In this example, the some_page() view’s handling of POST vs. GET requests is quite different. The only thing they

have in common is a shared URL: /somepage/. As such, it’s kind of inelegant to deal with both POST and GET in

the same view function. It would be nice if we could have two separate view functions – one handling GET requests

and the other handling POST – and ensuring each one was only called when appropriate.

136 Chapter 10. Chapter 8: Advanced Views and URLconfs

The Django Book, Release 2.0

We can do that by writing a view function that delegates to other views, either before or after executing some custom

logic. Here’s an example of how this technique could help simplify our some_page() view:

# views.py

from django.http import Http404, HttpResponseRedirect

from django.shortcuts import render

def method_splitter(request, GET=None, POST=None):

if request.method == ’GET’ and GET is not None:

return GET(request)

elif request.method == ’POST’ and POST is not None:

return POST(request)

raise Http404

def some_page_get(request):

assert request.method == ’GET’

do_something_for_get()

return render(request, ’page.html’)

def some_page_post(request):

assert request.method == ’POST’

do_something_for_post()

return HttpResponseRedirect(’/someurl/’)

# urls.py

from django.conf.urls.defaults import *

from mysite import views

urlpatterns =patterns(’’,

# ...

(r’^somepage/$’, views.method_splitter, {’GET’: views.some_page_get, ’POST’: views.some_page_post}),

# ...

)

Let’s go through what this does:

• We’ve written a new view, method_splitter(), that delegates to other views based on

request.method. It looks for two keyword arguments, GET and POST, which should be view functions.

If request.method is ’GET’, then it calls the GET view. If request.method is ’POST’, then it calls

the POST view. If request.method is something else (HEAD, etc.), or if GET or POST were not supplied to

the function, then it raises an Http404.

• In the URLconf, we point /somepage/ at method_splitter() and pass it extra arguments – the view

functions to use for GET and POST, respectively.

• Finally, we’ve split the some_page() view into two view functions – some_page_get() and

some_page_post(). This is much nicer than shoving all of that logic into a single view.

Note that these view functions technically no longer have to check request.method, because

method_splitter() does that. (By the time some_page_post() is called, for example, we can be

conﬁdent request.method is ’post’.) Still, just to be safe, and also to serve as documentation, we stuck

in an assert that makes sure request.method is what we expect it to be.

Now we have ourselves a nice, generic view function that encapsulates the logic of delegating a view by

request.method. Nothing about method_splitter() is tied to our speciﬁc application, of course, so we

can reuse it in other projects.

But, while we’re at it, there’s one way to improve on method_splitter(). As it’s written, it assumes that the

10.1. URLconf Tricks 137

The Django Book, Release 2.0

GET and POST views take no arguments other than request. What if we wanted to use method_splitter()

with views that, for example, capture text from URLs, or take optional keyword arguments themselves?

To do that, we can use a nice Python feature: variable arguments with asterisks. We’ll show the example ﬁrst, then

explain it:

def method_splitter(request, *args, **kwargs):

get_view =kwargs.pop(’GET’,None)

post_view =kwargs.pop(’POST’,None)

if request.method == ’GET’ and get_view is not None:

return get_view(request, *args, **kwargs)

elif request.method == ’POST’ and post_view is not None:

return post_view(request, *args, **kwargs)

raise Http404

Here, we’ve refactored method_splitter() to remove the GET and POST keyword arguments, in favor of *args

and **kwargs (note the asterisks). This is a Python feature that allows a function to accept a dynamic, arbitrary

number of arguments whose names aren’t known until runtime. If you put a single asterisk in front of a parameter in

a function deﬁnition, any positional arguments to that function will be rolled up into a single tuple. If you put two

asterisks in front of a parameter in a function deﬁnition, any keyword arguments to that function will be rolled up into

a single dictionary.

For example, with this function:

def foo(*args, **kwargs):

print "Positional arguments are:"

print args

print "Keyword arguments are:"

print kwargs

Here’s how it would work:

>>> foo(1,2,3)

Positional arguments are:

(1, 2, 3)

Keyword arguments are:

{}

>>> foo(1,2, name=’Adrian’, framework=’Django’)

Positional arguments are:

(1, 2)

Keyword arguments are:

{’framework’: ’Django’, ’name’: ’Adrian’}

Bringing this back to method_splitter(), you can see we’re using *args and **kwargs to accept any ar-

guments to the function and pass them along to the appropriate view. But before we do that, we make two calls to

kwargs.pop() to get the GET and POST arguments, if they’re available. (We’re using pop() with a default value

of None to avoid KeyError if one or the other isn’t deﬁned.)

10.1.12 Wrapping View Functions

Our ﬁnal view trick takes advantage of an advanced Python technique. Say you ﬁnd yourself repeating a bunch of

code throughout various views, as in this example:

def my_view1(request):

if not request.user.is_authenticated():

return HttpResponseRedirect(’/accounts/login/’)

# ...

return render(request, ’template1.html’)

138 Chapter 10. Chapter 8: Advanced Views and URLconfs

The Django Book, Release 2.0

def my_view2(request):

if not request.user.is_authenticated():

return HttpResponseRedirect(’/accounts/login/’)

# ...

return render(request, ’template2.html’)

def my_view3(request):

if not request.user.is_authenticated():

return HttpResponseRedirect(’/accounts/login/’)

# ...

return render(request, ’template3.html’)

Here, each view starts by checking that request.user is authenticated – that is, the current user has success-

fully logged into the site – and redirects to /accounts/login/ if not. (Note that we haven’t yet covered

request.user – Chapter 14 does – but, as you might imagine, request.user represents the current user,

either logged-in or anonymous.)

It would be nice if we could remove that bit of repetitive code from each of these views and just mark them as requiring

authentication. We can do that by making a view wrapper. Take a moment to study this:

def requires_login(view):

def new_view(request, *args, **kwargs):

if not request.user.is_authenticated():

return HttpResponseRedirect(’/accounts/login/’)

return view(request, *args, **kwargs)

return new_view

This function, requires_login, takes a view function (view) and returns a new view function (new_view).

The new function, new_view is deﬁned within requires_login and handles the logic of checking

request.user.is_authenticated() and delegating to the original view (view).

Now, we can remove the if not request.user.is_authenticated() checks from our views and simply

wrap them with requires_login in our URLconf:

from django.conf.urls.defaults import *

from mysite.views import requires_login, my_view1, my_view2, my_view3

urlpatterns =patterns(’’,

(r’^view1/$’, requires_login(my_view1)),

(r’^view2/$’, requires_login(my_view2)),

(r’^view3/$’, requires_login(my_view3)),

)

This has the same effect as before, but with less code redundancy. Now we’ve created a nice, generic function –

requires_login() that we can wrap around any view in order to make it require login.

10.2 Including Other URLconfs

If you intend your code to be used on multiple Django-based sites, you should consider arranging your URLconfs in

such a way that allows for “including.”

At any point, your URLconf can “include” other URLconf modules. This essentially “roots” a set of URLs below

other ones. For example, this URLconf includes other URLconfs:

from django.conf.urls.defaults import *

urlpatterns =patterns(’’,

10.2. Including Other URLconfs 139

The Django Book, Release 2.0

(r’^weblog/’, include(’mysite.blog.urls’)),

(r’^photos/’, include(’mysite.photos.urls’)),

(r’^about/$’,’mysite.views.about’),

)

(We saw this before in Chapter 6, when we introduced the Django admin site. The admin site has its own URLconf

that you merely include() within yours.)

There’s an important gotcha here: the regular expressions in this example that point to an include() do not have a

$(end-of-string match character) but do include a trailing slash. Whenever Django encounters include(), it chops

off whatever part of the URL matched up to that point and sends the remaining string to the included URLconf for

further processing.

Continuing this example, here’s the URLconf mysite.blog.urls:

from django.conf.urls.defaults import *

urlpatterns =patterns(’’,

(r’^(\d\d\d\d)/$’,’mysite.blog.views.year_detail’),

(r’^(\d\d\d\d)/(\d\d)/$’,’mysite.blog.views.month_detail’),

)

With these two URLconfs, here’s how a few sample requests would be handled:

•/weblog/2007/: In the ﬁrst URLconf, the pattern r’^weblog/’ matches. Because it is an include(),

Django strips all the matching text, which is ’weblog/’ in this case. The remaining part of the URL is

2007/, which matches the ﬁrst line in the mysite.blog.urls URLconf.

•/weblog//2007/ (with two slashes): In the ﬁrst URLconf, the pattern r’^weblog/’ matches. Because it

is an include(), Django strips all the matching text, which is ’weblog/’ in this case. The remaining part of

the URL is /2007/ (with a leading slash), which does not match any of the lines in the mysite.blog.urls

URLconf.

•/about/: This matches the view mysite.views.about in the ﬁrst URLconf, demonstrating that you can

mix include() patterns with non-include() patterns.

10.2.1 How Captured Parameters Work with include()

An included URLconf receives any captured parameters from parent URLconfs, for example:

# root urls.py

from django.conf.urls.defaults import *

urlpatterns =patterns(’’,

(r’^(?P<username>\w+)/blog/’, include(’foo.urls.blog’)),

)

# foo/urls/blog.py

from django.conf.urls.defaults import *

urlpatterns =patterns(’’,

(r’^$’,’foo.views.blog_index’),

(r’^archive/$’,’foo.views.blog_archive’),

)

In this example, the captured username variable is passed to the included URLconf and, hence, to every view

function within that URLconf.

140 Chapter 10. Chapter 8: Advanced Views and URLconfs

The Django Book, Release 2.0

Note that the captured parameters will always be passed to every line in the included URLconf, regardless of whether

the line’s view actually accepts those parameters as valid. For this reason, this technique is useful only if you’re certain

that every view in the included URLconf accepts the parameters you’re passing.

10.2.2 How Extra URLconf Options Work with include()

Similarly, you can pass extra URLconf options to include(), just as you can pass extra URLconf options to a

normal view – as a dictionary. When you do this, each line in the included URLconf will be passed the extra options.

For example, the following two URLconf sets are functionally identical.

Set one:

# urls.py

from django.conf.urls.defaults import *

urlpatterns =patterns(’’,

(r’^blog/’, include(’inner’), {’blogid’:3}),

)

# inner.py

from django.conf.urls.defaults import *

urlpatterns =patterns(’’,

(r’^archive/$’,’mysite.views.archive’),

(r’^about/$’,’mysite.views.about’),

(r’^rss/$’,’mysite.views.rss’),

)

Set two:

# urls.py

from django.conf.urls.defaults import *

urlpatterns =patterns(’’,

(r’^blog/’, include(’inner’)),

)

# inner.py

from django.conf.urls.defaults import *

urlpatterns =patterns(’’,

(r’^archive/$’,’mysite.views.archive’, {’blogid’:3}),

(r’^about/$’,’mysite.views.about’, {’blogid’:3}),

(r’^rss/$’,’mysite.views.rss’, {’blogid’:3}),

)

As is the case with captured parameters (explained in the previous section), extra options will always be passed to

every line in the included URLconf, regardless of whether the line’s view actually accepts those options as valid. For

this reason, this technique is useful only if you’re certain that every view in the included URLconf accepts the extra

options you’re passing.

10.2. Including Other URLconfs 141

The Django Book, Release 2.0

10.3 What’s Next?

This chapter has provided many advanced tips and tricks for views and URLconfs. Next, in Chapter 9, we’ll give this

advanced treatment to Django’s template system.

142 Chapter 10. Chapter 8: Advanced Views and URLconfs

CHAPTER 11

Chapter 9: Advanced Templates

Although most of your interactions with Django’s template language will be in the role of template author, you may

want to customize and extend the template engine – either to make it do something it doesn’t already do, or to make

your job easier in some other way.

This chapter delves deep into the guts of Django’s template system. It covers what you need to know if you plan to

extend the system or if you’re just curious about how it works. It also covers the auto-escaping feature, a security

measure you’ll no doubt notice over time as you continue to use Django.

If you’re looking to use the Django template system as part of another application (i.e., without the rest of the frame-

work), make sure to read the “Conﬁguring the Template System in Standalone Mode” section later in the chapter.

11.1 Template Language Review

First, let’s quickly review a number of terms introduced in Chapter 4:

• A template is a text document, or a normal Python string, that is marked up using the Django template language.

A template can contain template tags and variables.

• A template tag is a symbol within a template that does something. This deﬁnition is deliberately vague. For

example, a template tag can produce content, serve as a control structure (an if statement or for loop), grab

content from a database, or enable access to other template tags.

Template tags are surrounded by {% and %}:

{% if is_logged_in %}

Thanks for logging in!

{% else %}

Please log in.

{% endif %}

• A variable is a symbol within a template that outputs a value.

Variable tags are surrounded by {{ and }}:

My first name is {{ first_name }}. My last name is {{ last_name }}.

• A context is a name -> value mapping (similar to a Python dictionary) that is passed to a template.

• A template renders a context by replacing the variable “holes” with values from the context and executing all

template tags.

For more details about the basics of these terms, refer back to Chapter 4.

143

The Django Book, Release 2.0

The rest of this chapter discusses ways of extending the template engine. First, though, let’s take a quick look at a few

internals left out of Chapter 4 for simplicity.

11.2 RequestContext and Context Processors

When rendering a template, you need a context. This can be an instance of django.template.Context,

but Django also comes with a subclass, django.template.RequestContext, that acts slightly differently.

RequestContext adds a bunch of variables to your template context by default – things like the HttpRequest

object or information about the currently logged-in user. The render() shortcut creates a RequestContext

unless it is passed a different context instance explicitly.

Use RequestContext when you don’t want to have to specify the same set of variables in a series of templates.

For example, consider these two views:

from django.template import loader, Context

def view_1(request):

# ...

t=loader.get_template(’template1.html’)

c=Context({

’app’:’My app’,

’user’: request.user,

’ip_address’: request.META[’REMOTE_ADDR’],

’message’:’I am view 1.’

})

return t.render(c)

def view_2(request):

# ...

t=loader.get_template(’template2.html’)

c=Context({

’app’:’My app’,

’user’: request.user,

’ip_address’: request.META[’REMOTE_ADDR’],

’message’:’I am the second view.’

})

return t.render(c)

(Note that we’re deliberately not using the render() shortcut in these examples – we’re manually loading the

templates, constructing the context objects and rendering the templates. We’re “spelling out” all of the steps for the

purpose of clarity.)

Each view passes the same three variables – app,user and ip_address – to its template. Wouldn’t it be nice if

we could remove that redundancy?

RequestContext and context processors were created to solve this problem. Context processors let you specify

a number of variables that get set in each context automatically – without you having to specify the variables in each

render() call. The catch is that you have to use RequestContext instead of Context when you render a

template.

The most low-level way of using context processors is to create some processors and pass them to

RequestContext. Here’s how the above example could be written with context processors:

from django.template import loader, RequestContext

def custom_proc(request):

"A context processor that provides ’app’, ’user’ and ’ip_address’."

144 Chapter 11. Chapter 9: Advanced Templates

The Django Book, Release 2.0

return {

’app’:’My app’,

’user’: request.user,

’ip_address’: request.META[’REMOTE_ADDR’]

}

def view_1(request):

# ...

t=loader.get_template(’template1.html’)

c=RequestContext(request, {’message’:’I am view 1.’},

processors=[custom_proc])

return t.render(c)

def view_2(request):

# ...

t=loader.get_template(’template2.html’)

c=RequestContext(request, {’message’:’I am the second view.’},

processors=[custom_proc])

return t.render(c)

Let’s step through this code:

• First, we deﬁne a function custom_proc. This is a context processor – it takes an HttpRequest object and

returns a dictionary of variables to use in the template context. That’s all it does.

• We’ve changed the two view functions to use RequestContext instead of Context. There are two dif-

ferences in how the context is constructed. One, RequestContext requires the ﬁrst argument to be an

HttpRequest object – the one that was passed into the view function in the ﬁrst place (request). Two,

RequestContext takes an optional processors argument, which is a list or tuple of context processor

functions to use. Here, we pass in custom_proc, the custom processor we deﬁned above.

• Each view no longer has to include app,user or ip_address in its context construction, because those are

provided by custom_proc.

• Each view still has the ﬂexibility to introduce any custom template variables it might need. In this example, the

message template variable is set differently in each view.

In Chapter 4, we introduced the render() shortcut, which saves you from having to call

loader.get_template(), then create a Context, then call the render() method on the template. In

order to demonstrate the lower-level workings of context processors, the above examples didn’t use render(), . But

it’s possible – and preferable – to use context processors with render(). Do this with the context_instance

argument, like so:

from django.shortcuts import render

from django.template import RequestContext

def custom_proc(request):

"A context processor that provides ’app’, ’user’ and ’ip_address’."

return {

’app’:’My app’,

’user’: request.user,

’ip_address’: request.META[’REMOTE_ADDR’]

}

def view_1(request):

# ...

return render(request, ’template1.html’,

{’message’:’I am view 1.’},

context_instance=RequestContext(request, processors=[custom_proc]))

11.2. RequestContext and Context Processors 145

The Django Book, Release 2.0

def view_2(request):

# ...

return render(request, ’template2.html’,

{’message’:’I am the second view.’},

context_instance=RequestContext(request, processors=[custom_proc]))

Here, we’ve trimmed down each view’s template rendering code to a single (wrapped) line.

This is an improvement, but, evaluating the conciseness of this code, we have to admit we’re now almost overdosing

on the other end of the spectrum. We’ve removed redundancy in data (our template variables) at the cost of adding

redundancy in code (in the processors call). Using context processors doesn’t save you much typing if you have

to type processors all the time.

For that reason, Django provides support for global context processors. The TEMPLATE_CONTEXT_PROCESSORS

setting (in your settings.py) designates which context processors should always be applied to

RequestContext. This removes the need to specify processors each time you use RequestContext.

By default, TEMPLATE_CONTEXT_PROCESSORS is set to the following:

TEMPLATE_CONTEXT_PROCESSORS =(

’django.core.context_processors.auth’,

’django.core.context_processors.debug’,

’django.core.context_processors.i18n’,

’django.core.context_processors.media’,

)

This setting is a tuple of callables that use the same interface as our custom_proc function above – functions that

take a request object as their argument and return a dictionary of items to be merged into the context. Note that the

values in TEMPLATE_CONTEXT_PROCESSORS are speciﬁed as strings, which means the processors are required to

be somewhere on your Python path (so you can refer to them from the setting).

Each processor is applied in order. That is, if one processor adds a variable to the context and a second processor adds

a variable with the same name, the second will override the ﬁrst.

Django provides a number of simple context processors, including the ones that are enabled by default:

11.2.1 django.core.context_processors.auth

If TEMPLATE_CONTEXT_PROCESSORS contains this processor, every RequestContext will contain these vari-

ables:

•user: A django.contrib.auth.models.User instance representing the current logged-in user (or an

AnonymousUser instance, if the client isn’t logged in).

•messages: A list of messages (as strings) for the current logged-in user. Behind the scenes, this variable

calls request.user.get_and_delete_messages() for every request. That method collects the user’s

messages and deletes them from the database.

•perms: An instance of django.core.context_processors.PermWrapper, which represents the

permissions the current logged-in user has.

See Chapter 14 for more information on users, permissions, and messages.

11.2.2 django.core.context_processors.debug

This processor pushes debugging information down to the template layer. If TEMPLATE_CONTEXT_PROCESSORS

contains this processor, every RequestContext will contain these variables:

146 Chapter 11. Chapter 9: Advanced Templates

The Django Book, Release 2.0

•debug: The value of your DEBUG setting (either True or False). You can use this variable in templates to

test whether you’re in debug mode.

•sql_queries: A list of {’sql’: ..., ’time’: ...} dictionaries representing every SQL query

that has happened so far during the request and how long it took. The list is in the order in which the queries

were issued.

Because debugging information is sensitive, this context processor will only add variables to the context if both of the

following conditions are true:

• The DEBUG setting is True.

• The request came from an IP address in the INTERNAL_IPS setting.

Astute readers will notice that the debug template variable will never have the value False because, if DEBUG is

False, then the debug template variable won’t be populated in the ﬁrst place.

11.2.3 django.core.context_processors.i18n

If this processor is enabled, every RequestContext will contain these variables:

•LANGUAGES: The value of the LANGUAGES setting.

•LANGUAGE_CODE:request.LANGUAGE_CODE if it exists; otherwise, the value of the LANGUAGE_CODE

setting.

Appendix D provides more information about these two settings.

11.2.4 django.core.context_processors.request

If this processor is enabled, every RequestContext will contain a variable request, which is the current

HttpRequest object. Note that this processor is not enabled by default; you have to activate it.

You might want to use this if you ﬁnd your templates needing to access attributes of the current HttpRequest such

as the IP address:

11.2.5 Guidelines for Writing Your Own Context Processors

Here are a few tips for rolling your own:

• Make each context processor responsible for the smallest subset of functionality possible. It’s easy to use

multiple processors, so you might as well split functionality into logical pieces for future reuse.

• Keep in mind that any context processor in TEMPLATE_CONTEXT_PROCESSORS will be available in every

template powered by that settings ﬁle, so try to pick variable names that are unlikely to conﬂict with variable

names your templates might be using independently. As variable names are case-sensitive, it’s not a bad idea to

use all caps for variables that a processor provides.

• It doesn’t matter where on the ﬁlesystem they live, as long as they’re on your Python path so you can point to

them from the TEMPLATE_CONTEXT_PROCESSORS setting. With that said, the convention is to save them in

a ﬁle called context_processors.py within your app or project.

11.2. RequestContext and Context Processors 147

The Django Book, Release 2.0

11.3 Automatic HTML Escaping

When generating HTML from templates, there’s always a risk that a variable will include characters that affect the

resulting HTML. For example, consider this template fragment:

Hello, {{ name }}.

At ﬁrst, this seems like a harmless way to display a user’s name, but consider what would happen if the user entered

his name as this:

With this name value, the template would be rendered as:

Hello, <script>alert(’hello’)</script>

...which means the browser would pop-up a JavaScript alert box!

Similarly, what if the name contained a ’<’ symbol, like this?

<b>username

That would result in a rendered template like this:

Hello, <b>username

...which, in turn, would result in the remainder of the Web page being bolded!

Clearly, user-submitted data shouldn’t be trusted blindly and inserted directly into your Web pages, because a malicious

user could use this kind of hole to do potentially bad things. This type of security exploit is called a Cross Site Scripting

(XSS) attack. (For more on security, see Chapter 20.)

To avoid this problem, you have two options:

• One, you can make sure to run each untrusted variable through the escape ﬁlter, which converts potentially

harmful HTML characters to unharmful ones. This was the default solution in Django for its ﬁrst few years, but

the problem is that it puts the onus on you, the developer / template author, to ensure you’re escaping everything.

It’s easy to forget to escape data.

• Two, you can take advantage of Django’s automatic HTML escaping. The remainder of this section describes

how auto-escaping works.

By default in Django, every template automatically escapes the output of every variable tag. Speciﬁcally, these ﬁve

characters are escaped:

•<is converted to <

•>is converted to >

•’(single quote) is converted to '

•"(double quote) is converted to "

•&is converted to &

Again, we stress that this behavior is on by default. If you’re using Django’s template system, you’re protected.

11.3.1 How to Turn it Off

If you don’t want data to be auto-escaped, on a per-site, per-template level or per-variable level, you can turn it off in

several ways.

148 Chapter 11. Chapter 9: Advanced Templates

The Django Book, Release 2.0

Why would you want to turn it off? Because sometimes, template variables contain data that you intend to be rendered

as raw HTML, in which case you don’t want their contents to be escaped. For example, you might store a blob of

trusted HTML in your database and want to embed that directly into your template. Or, you might be using Django’s

template system to produce text that is not HTML – like an e-mail message, for instance.

For Individual Variables

To disable auto-escaping for an individual variable, use the safe ﬁlter:

This will be escaped: {{ data }}

This will not be escaped: {{ data|safe }}

Think of safe as shorthand for safe from further escaping or can be safely interpreted as HTML. In this example, if

data contains ’<b>’, the output will be:

This will be escaped: <b>

This will not be escaped: <b>

For Template Blocks

To control auto-escaping for a template, wrap the template (or just a particular section of the template) in the

autoescape tag, like so:

{% autoescape off %}

Hello {{ name }}

{% endautoescape %}

The autoescape tag takes either on or off as its argument. At times, you might want to force auto-escaping when

it would otherwise be disabled. Here is an example template:

Auto-escaping is on by default. Hello {{ name }}

{% autoescape off %}

This will not be auto-escaped: {{ data }}.

Nor this: {{ other_data }}

{% autoescape on %}

Auto-escaping applies again: {{ name }}

{% endautoescape %}

The auto-escaping tag passes its effect on to templates that extend the current one as well as templates included via

the include tag, just like all block tags. For example:

# base.html

{% autoescape off %}

<h1>{% block title %}{% endblock %}</h1>

{% block content %}

{% endblock %}

{% endautoescape %}

# child.html

{% extends "base.html" %}

{% block title %}This & that{% endblock %}

{% block content %}{{ greeting }}{% endblock %}

11.3. Automatic HTML Escaping 149

The Django Book, Release 2.0

Because auto-escaping is turned off in the base template, it will also be turned off in the child template, resulting in

the following rendered HTML when the greeting variable contains the string <b>Hello!</b>:

<b>Hello!</b>

11.3.2 Notes

Generally, template authors don’t need to worry about auto-escaping very much. Developers on the Python side

(people writing views and custom ﬁlters) need to think about the cases in which data shouldn’t be escaped, and mark

data appropriately, so things work in the template.

If you’re creating a template that might be used in situations where you’re not sure whether auto-escaping is enabled,

then add an escape ﬁlter to any variable that needs escaping. When auto-escaping is on, there’s no danger of the

escape ﬁlter double-escaping data – the escape ﬁlter does not affect auto-escaped variables.

11.3.3 Automatic Escaping of String Literals in Filter Arguments

As we mentioned earlier, ﬁlter arguments can be strings:

All string literals are inserted without any automatic escaping into the template – they act as if they were all passed

through the safe ﬁlter. The reasoning behind this is that the template author is in control of what goes into the string

literal, so they can make sure the text is correctly escaped when the template is written.

This means you would write

...rather than

{{ data|default:"3 < 2" }} <-- Bad! Don’t do this.

This doesn’t affect what happens to data coming from the variable itself. The variable’s contents are still automatically

escaped, if necessary, because they’re beyond the control of the template author.

11.4 Inside Template Loading

Generally, you’ll store templates in ﬁles on your ﬁlesystem, but you can use custom template loaders to load templates

from other sources.

Django has two ways to load templates:

•django.template.loader.get_template(template_name):get_template returns the com-

piled template (a Template object) for the template with the given name. If the template doesn’t exist, a

TemplateDoesNotExist exception will be raised.

•django.template.loader.select_template(template_name_list):select_template

is just like get_template, except it takes a list of template names. Of the list, it returns the ﬁrst template that

exists. If none of the templates exist, a TemplateDoesNotExist exception will be raised.

As covered in Chapter 4, each of these functions by default uses your TEMPLATE_DIRS setting to load templates.

Internally, however, these functions actually delegate to a template loader for the heavy lifting.

150 Chapter 11. Chapter 9: Advanced Templates

The Django Book, Release 2.0

Some of loaders are disabled by default, but you can activate them by editing the TEMPLATE_LOADERS setting.

TEMPLATE_LOADERS should be a tuple of strings, where each string represents a template loader. These template

loaders ship with Django:

•django.template.loaders.filesystem.load_template_source: This loader loads templates

from the ﬁlesystem, according to TEMPLATE_DIRS. It is enabled by default.

•django.template.loaders.app_directories.load_template_source: This loader loads

templates from Django applications on the ﬁlesystem. For each application in INSTALLED_APPS, the loader

looks for a templates subdirectory. If the directory exists, Django looks for templates there.

This means you can store templates with your individual applications, making it easy to distribute Django

applications with default templates. For example, if INSTALLED_APPS contains (’myproject.polls’,

’myproject.music’), then get_template(’foo.html’) will look for templates in this order:

–/path/to/myproject/polls/templates/foo.html

–/path/to/myproject/music/templates/foo.html

Note that the loader performs an optimization when it is ﬁrst imported: it caches a list of which

INSTALLED_APPS packages have a templates subdirectory.

This loader is enabled by default.

•django.template.loaders.eggs.load_template_source: This loader is just like

app_directories, except it loads templates from Python eggs rather than from the ﬁlesystem. This

loader is disabled by default; you’ll need to enable it if you’re using eggs to distribute your application. (Python

eggs are a way of compressing Python code into a single ﬁle.)

Django uses the template loaders in order according to the TEMPLATE_LOADERS setting. It uses each loader until a

loader ﬁnds a match.

11.5 Extending the Template System

Now that you understand a bit more about the internals of the template system, let’s look at how to extend the system

with custom code.

Most template customization comes in the form of custom template tags and/or ﬁlters. Although the Django template

language comes with many built-in tags and ﬁlters, you’ll probably assemble your own libraries of tags and ﬁlters that

ﬁt your own needs. Fortunately, it’s quite easy to deﬁne your own functionality.

11.5.1 Creating a Template Library

Whether you’re writing custom tags or ﬁlters, the ﬁrst thing to do is to create a template library – a small bit of

infrastructure Django can hook into.

Creating a template library is a two-step process:

• First, decide which Django application should house the template library. If you’ve created an app via

manage.py startapp, you can put it in there, or you can create another app solely for the template li-

brary. We’d recommend the latter, because your ﬁlters might be useful to you in future projects.

Whichever route you take, make sure to add the app to your INSTALLED_APPS setting. We’ll explain this

shortly.

• Second, create a templatetags directory in the appropriate Django application’s package. It should be on

the same level as models.py,views.py, and so forth. For example:

11.5. Extending the Template System 151

The Django Book, Release 2.0

books/

__init__.py

models.py

templatetags/

views.py

Create two empty ﬁles in the templatetags directory: an __init__.py ﬁle (to indicate to Python that this

is a package containing Python code) and a ﬁle that will contain your custom tag/ﬁlter deﬁnitions. The name of

the latter ﬁle is what you’ll use to load the tags later. For example, if your custom tags/ﬁlters are in a ﬁle called

poll_extras.py, you’d write the following in a template:

{% load poll_extras %}

The {% load %} tag looks at your INSTALLED_APPS setting and only allows the loading of template

libraries within installed Django applications. This is a security feature; it allows you to host Python code for

many template libraries on a single computer without enabling access to all of them for every Django installation.

If you write a template library that isn’t tied to any particular models/views, it’s valid and quite normal to have a

Django application package that contains only a templatetags package. There’s no limit on how many modules

you put in the templatetags package. Just keep in mind that a {% load %} statement will load tags/ﬁlters for

the given Python module name, not the name of the application.

Once you’ve created that Python module, you’ll just have to write a bit of Python code, depending on whether you’re

writing ﬁlters or tags.

To be a valid tag library, the module must contain a module-level variable named register that is an instance of

template.Library. This is the data structure in which all the tags and ﬁlters are registered. So, near the top of

your module, insert the following:

from django import template

Note: For a ﬁne selection of examples, read the source code for Django’s default ﬁlters and tags. They’re

in django/template/defaultfilters.py and django/template/defaulttags.py, respectively.

Some applications in django.contrib also contain template libraries.

Once you’ve created this register variable, you’ll use it to create template ﬁlters and tags.

11.5.2 Writing Custom Template Filters

Custom ﬁlters are just Python functions that take one or two arguments:

• The value of the variable (input)

• The value of the argument, which can have a default value or be left out altogether

For example, in the ﬁlter {{ var|foo:"bar" }}, the ﬁlter foo would be passed the contents of the variable var

and the argument "bar".

Filter functions should always return something. They shouldn’t raise exceptions, and they should fail silently. If

there’s an error, they should return either the original input or an empty string, whichever makes more sense.

Here’s an example ﬁlter deﬁnition:

def cut(value, arg):

"Removes all values of arg from the given string"

return value.replace(arg, ’’)

152 Chapter 11. Chapter 9: Advanced Templates

The Django Book, Release 2.0

And here’s an example of how that ﬁlter would be used to cut spaces from a variable’s value:

Most ﬁlters don’t take arguments. In this case, just leave the argument out of your function:

def lower(value): # Only one argument.

"Converts a string into all lowercase"

return value.lower()

When you’ve written your ﬁlter deﬁnition, you need to register it with your Library instance, to make it available

to Django’s template language:

The Library.filter() method takes two arguments:

• The name of the ﬁlter (a string)

• The ﬁlter function itself

If you’re using Python 2.4 or above, you can use register.filter() as a decorator instead:

@register.filter(name=’cut’)

def cut(value, arg):

return value.replace(arg, ’’)

@register.filter

def lower(value):

return value.lower()

If you leave off the name argument, as in the second example, Django will use the function’s name as the ﬁlter name.

Here, then, is a complete template library example, supplying the cut ﬁlter:

from django import template

@register.filter(name=’cut’)

def cut(value, arg):

return value.replace(arg, ’’)

11.5.3 Writing Custom Template Tags

Tags are more complex than ﬁlters, because tags can do nearly anything.

Chapter 4 describes how the template system works in a two-step process: compiling and rendering. To deﬁne a

custom template tag, you need to tell Django how to manage both of these steps when it gets to your tag.

When Django compiles a template, it splits the raw template text into nodes. Each node is an instance of

django.template.Node and has a render() method. Thus, a compiled template is simply a list of Node

objects. For example, consider this template:

Hello, {{ person.name }}.

{% ifequal name.birthday today %}

Happy birthday!

{% else %}

11.5. Extending the Template System 153

The Django Book, Release 2.0

Be sure to come back on your birthday

for a splendid surprise message.

{% endifequal %}

In compiled template form, this template is represented as this list of nodes:

• Text node: "Hello, "

• Variable node: person.name

• Text node: ".\n\n"

• IfEqual node: name.birthday and today

When you call render() on a compiled template, the template calls render() on each Node in its node list, with

the given context. The results are all concatenated together to form the output of the template. Thus, to deﬁne a custom

template tag, you specify how the raw template tag is converted into a Node (the compilation function) and what the

node’s render() method does.

In the sections that follow, we cover all the steps in writing a custom tag.

11.5.4 Writing the Compilation Function

For each template tag the parser encounters, it calls a Python function with the tag contents and the parser object itself.

This function is responsible for returning a Node instance based on the contents of the tag.

For example, let’s write a template tag, {% current_time %}, that displays the current

date/time, formatted according to a parameter given in the tag, in strftime syntax (see

http://www.djangoproject.com/r/python/strftime/). It’s a good idea to decide the tag syn-

tax before anything else. In our case, let’s say the tag should be used like this:

<p>The time is {% current_time "%Y-%m-%d %I:%M %p" %}.</p>

Note: Yes, this template tag is redundant–Django’s default {% now %} tag does the same task with simpler syntax.

This template tag is presented here just for example purposes.

The parser for this function should grab the parameter and create a Node object:

from django import template

def do_current_time(parser, token):

try:

# split_contents() knows not to split quoted strings.

tag_name, format_string =token.split_contents()

except ValueError:

msg =’%r tag requires a single argument’ %token.split_contents()[0]

raise template.TemplateSyntaxError(msg)

return CurrentTimeNode(format_string[1:-1])

There’s a lot going here:

• Each template tag compilation function takes two arguments, parser and token.parser is the template

parser object. We don’t use it in this example. token is the token currently being parsed by the parser.

•token.contents is a string of the raw contents of the tag. In our example, it’s ’current_time

"%Y-%m-%d %I:%M %p"’.

154 Chapter 11. Chapter 9: Advanced Templates

The Django Book, Release 2.0

• The token.split_contents() method separates the arguments on spaces while keeping quoted strings

together. Avoid using token.contents.split() (which just uses Python’s standard string-splitting se-

mantics). It’s not as robust, as it naively splits on all spaces, including those within quoted strings.

• This function is responsible for raising django.template.TemplateSyntaxError, with helpful mes-

sages, for any syntax error.

• Don’t hard-code the tag’s name in your error messages, because that couples the tag’s name to your function.

token.split_contents()[0] will always be the name of your tag – even when the tag has no arguments.

• The function returns a CurrentTimeNode (which we’ll create shortly) containing everything the node needs

to know about this tag. In this case, it just passes the argument "%Y-%m-%d %I:%M %p". The leading and

trailing quotes from the template tag are removed with format_string[1:-1].

• Template tag compilation functions must return a Node subclass; any other return value is an error.

11.5.5 Writing the Template Node

The second step in writing custom tags is to deﬁne a Node subclass that has a render() method. Continuing the

preceding example, we need to deﬁne CurrentTimeNode:

import datetime

class CurrentTimeNode(template.Node):

def __init__(self, format_string):

self.format_string =str(format_string)

def render(self, context):

now =datetime.datetime.now()

return now.strftime(self.format_string)

These two functions (__init__() and render()) map directly to the two steps in template processing (com-

pilation and rendering). Thus, the initialization function only needs to store the format string for later use, and the

render() function does the real work.

Like template ﬁlters, these rendering functions should fail silently instead of raising errors. The only time that template

tags are allowed to raise errors is at compilation time.

11.5.6 Registering the Tag

Finally, you need to register the tag with your module’s Library instance. Registering custom tags is very similar to

registering custom ﬁlters (as explained above). Just instantiate a template.Library instance and call its tag()

method. For example:

The tag() method takes two arguments:

• The name of the template tag (string).

• The compilation function.

As with ﬁlter registration, it is also possible to use register.tag as a decorator in Python 2.4 and above:

@register.tag(name="current_time")

def do_current_time(parser, token):

# ...

@register.tag

11.5. Extending the Template System 155

The Django Book, Release 2.0

def shout(parser, token):

# ...

If you leave off the name argument, as in the second example, Django will use the function’s name as the tag name.

11.5.7 Setting a Variable in the Context

The previous section’s example simply returned a value. Often it’s useful to set template variables instead of returning

values. That way, template authors can just use the variables that your template tags set.

To set a variable in the context, use dictionary assignment on the context object in the render() method. Here’s an

updated version of CurrentTimeNode that sets a template variable, current_time, instead of returning it:

class CurrentTimeNode2(template.Node):

def __init__(self, format_string):

self.format_string =str(format_string)

def render(self, context):

now =datetime.datetime.now()

context[’current_time’]=now.strftime(self.format_string)

return ’’

(We’ll leave the creation of a do_current_time2 function, plus the registration of that function to a

current_time2 template tag, as exercises for the reader.)

Note that render() returns an empty string. render() should always return a string, so if all the template tag

does is set a variable, render() should return an empty string.

Here’s how you’d use this new version of the tag:

{% current_time2 "%Y-%M-%d %I:%M %p" %}

<p>The time is {{ current_time }}.</p>

But there’s a problem with CurrentTimeNode2: the variable name current_time is hard-coded. This

means you’ll need to make sure your template doesn’t use {{ current_time }} anywhere else, because {%

current_time2 %} will blindly overwrite that variable’s value.

A cleaner solution is to make the template tag specify the name of the variable to be set, like so:

{% get_current_time "%Y-%M-%d %I:%M %p" as my_current_time %}

<p>The current time is {{ my_current_time }}.</p>

To do so, you’ll need to refactor both the compilation function and the Node class, as follows:

import re

class CurrentTimeNode3(template.Node):

def __init__(self, format_string, var_name):

self.format_string =str(format_string)

self.var_name =var_name

def render(self, context):

now =datetime.datetime.now()

context[self.var_name] =now.strftime(self.format_string)

return ’’

def do_current_time(parser, token):

# This version uses a regular expression to parse tag contents.

try:

156 Chapter 11. Chapter 9: Advanced Templates

The Django Book, Release 2.0

# Splitting by None == splitting by spaces.

tag_name, arg =token.contents.split(None,1)

except ValueError:

msg =’%r tag requires arguments’ %token.contents[0]

raise template.TemplateSyntaxError(msg)

m=re.search(r’(.*?) as (\w+)’, arg)

if m:

fmt, var_name =m.groups()

else:

msg =’%r tag had invalid arguments’ %tag_name

raise template.TemplateSyntaxError(msg)

if not (fmt[0]== fmt[-1]and fmt[0]in (’"’,"’")):

msg ="%r tag’s argument should be in quotes" %tag_name

raise template.TemplateSyntaxError(msg)

return CurrentTimeNode3(fmt[1:-1], var_name)

Now do_current_time() passes the format string and the variable name to CurrentTimeNode3.

11.5.8 Parsing Until Another Template Tag

Template tags can work as blocks containing other tags (like {% if %},{% for %}, etc.). To create a template

tag like this, use parser.parse() in your compilation function.

Here’s how the standard {% comment %} tag is implemented:

def do_comment(parser, token):

nodelist =parser.parse((’endcomment’,))

parser.delete_first_token()

return CommentNode()

class CommentNode(template.Node):

def render(self, context):

return ’’

parser.parse() takes a tuple of names of template tags to parse until. It returns an instance of

django.template.NodeList, which is a list of all Node objects that the parser encountered before it encoun-

tered any of the tags named in the tuple.

So in the preceding example, nodelist is a list of all nodes between {% comment %} and {% endcomment

%}, not counting {% comment %} and {% endcomment %} themselves.

After parser.parse() is called, the parser hasn’t yet “consumed” the {% endcomment %} tag, so the code

needs to explicitly call parser.delete_first_token() to prevent that tag from being processed twice.

Then CommentNode.render() simply returns an empty string. Anything between {% comment %} and {%

endcomment %} is ignored.

11.5.9 Parsing Until Another Template Tag and Saving Contents

In the previous example, do_comment() discarded everything between {% comment %} and {% endcomment

%}. It’s also possible to do something with the code between template tags instead.

For example, here’s a custom template tag, {% upper %}, that capitalizes everything between itself and {%

endupper %}:

11.5. Extending the Template System 157

The Django Book, Release 2.0

{% upper %}

This will appear in uppercase, {{ user_name }}.

{% endupper %}

As in the previous example, we’ll use parser.parse(). This time, we pass the resulting nodelist to Node:

def do_upper(parser, token):

nodelist =parser.parse((’endupper’,))

parser.delete_first_token()

return UpperNode(nodelist)

class UpperNode(template.Node):

def __init__(self, nodelist):

self.nodelist =nodelist

def render(self, context):

output =self.nodelist.render(context)

return output.upper()

The only new concept here is self.nodelist.render(context) in UpperNode.render(). This simply

calls render() on each Node in the node list.

For more examples of complex rendering, see the source code for {% if %},{% for %},{% ifequal %}, and

{% ifchanged %}. They live in django/template/defaulttags.py.

11.5.10 Shortcut for Simple Tags

Many template tags take a single argument – a string or a template variable reference – and return a string af-

ter doing some processing based solely on the input argument and some external information. For example, the

current_time tag we wrote earlier is of this variety. We give it a format string, and it returns the time as a string.

To ease the creation of these types of tags, Django provides a helper function, simple_tag. This function, which

is a method of django.template.Library, takes a function that accepts one argument, wraps it in a render

function and the other necessary bits mentioned previously, and registers it with the template system.

Our earlier current_time function could thus be written like this:

def current_time(format_string):

try:

return datetime.datetime.now().strftime(str(format_string))

except UnicodeEncodeError:

return ’’

In Python 2.4, the decorator syntax also works:

@register.simple_tag

def current_time(token):

# ...

Notice a couple of things to notice about the simple_tag helper function:

• Only the (single) argument is passed into our function.

• Checking for the required number of arguments has already been done by the time our function is called, so we

don’t need to do that.

• The quotes around the argument (if any) have already been stripped away, so we receive a plain Unicode string.

158 Chapter 11. Chapter 9: Advanced Templates

The Django Book, Release 2.0

11.5.11 Inclusion Tags

Another common template tag is the type that displays some data by rendering another template. For example,

Django’s admin interface uses custom template tags to display the buttons along the bottom of the “add/change” form

pages. Those buttons always look the same, but the link targets change depending on the object being edited. They’re

a perfect case for using a small template that is ﬁlled with details from the current object.

These sorts of tags are called inclusion tags. Writing inclusion tags is probably best demonstrated by example. Let’s

write a tag that produces a list of books for a given Author object. We’ll use the tag like this:

{% books_for_author author %}

The result will be something like this:

<ul>

<li>Green Eggs And Ham</li>

</ul>

First, we deﬁne the function that takes the argument and produces a dictionary of data for the result. Notice that

we need to return only a dictionary, not anything more complex. This will be used as the context for the template

fragment:

def books_for_author(author):

books =Book.objects.filter(authors__id=author.id)

return {’books’: books}

Next, we create the template used to render the tag’s output. Following our example, the template is very simple:

<ul>

{% for book in books %}

<li>{{ book.title }}</li>

{% endfor %}

</ul>

Finally, we create and register the inclusion tag by calling the inclusion_tag() method on a Library object.

Following our example, if the preceding template is in a ﬁle called book_snippet.html, we register the tag like

this:

Python 2.4 decorator syntax works as well, so we could have written this, instead:

@register.inclusion_tag(’book_snippet.html’)

def books_for_author(author):

# ...

Sometimes, your inclusion tags need access to values from the parent template’s context. To solve this, Django

provides a takes_context option for inclusion tags. If you specify takes_context in creating an inclusion

tag, the tag will have no required arguments, and the underlying Python function will have one argument: the template

context as of when the tag was called.

For example, say you’re writing an inclusion tag that will always be used in a context that contains home_link and

home_title variables that point back to the main page. Here’s what the Python function would look like:

@register.inclusion_tag(’link.html’, takes_context=True)

def jump_link(context):

return {

’link’: context[’home_link’],

11.5. Extending the Template System 159

The Django Book, Release 2.0

’title’: context[’home_title’],

}

(Note that the ﬁrst parameter to the function must be called context.)

The template link.html might contain the following:

Jump directly to <a href="{{ link }}">{{ title }}</a>.

Then, anytime you want to use that custom tag, load its library and call it without any arguments, like so:

{% jump_link %}

11.6 Writing Custom Template Loaders

Django’s built-in template loaders (described in the “Inside Template Loading” section above) will usually cover all

your template-loading needs, but it’s pretty easy to write your own if you need special loading logic. For example, you

could load templates from a database, or directly from a Subversion repository using Subversion’s Python bindings,

or (as shown shortly) from a ZIP archive.

A template loader – that is, each entry in the TEMPLATE_LOADERS setting – is expected to be a callable object with

this interface:

load_template_source(template_name, template_dirs=None)

The template_name argument is the name of the template to load (as passed to loader.get_template()

or loader.select_template()), and template_dirs is an optional list of directories to search instead of

TEMPLATE_DIRS.

If a loader is able to successfully load a template, it should return a tuple: (template_source,

template_path). Here, template_source is the template string that will be compiled by the template en-

gine, and template_path is the path the template was loaded from. That path might be shown to the user for

debugging purposes, so it should quickly identify where the template was loaded from.

If the loader is unable to load a template, it should raise django.template.TemplateDoesNotExist.

Each loader function should also have an is_usable function attribute. This is a Boolean that informs the template

engine whether this loader is available in the current Python installation. For example, the eggs loader (which is

capable of loading templates from Python eggs) sets is_usable to False if the pkg_resources module isn’t

installed, because pkg_resources is necessary to read data from eggs.

An example should help clarify all of this. Here’s a template loader function that can load templates from a ZIP ﬁle. It

uses a custom setting, TEMPLATE_ZIP_FILES, as a search path instead of TEMPLATE_DIRS, and it expects each

item on that path to be a ZIP ﬁle containing templates:

from django.conf import settings

from django.template import TemplateDoesNotExist

import zipfile

def load_template_source(template_name, template_dirs=None):

"Template loader that loads templates from a ZIP file."

template_zipfiles =getattr(settings, "TEMPLATE_ZIP_FILES", [])

# Try each ZIP file in TEMPLATE_ZIP_FILES.

for fname in template_zipfiles:

try:

z=zipfile.ZipFile(fname)

160 Chapter 11. Chapter 9: Advanced Templates

The Django Book, Release 2.0

source =z.read(template_name)

except (IOError,KeyError):

continue

z.close()

# We found a template, so return the source.

template_path ="%s:%s"%(fname, template_name)

return (source, template_path)

# If we reach here, the template couldn’t be loaded

raise TemplateDoesNotExist(template_name)

# This loader is always usable (since zipfile is included with Python)

load_template_source.is_usable =True

The only step left if we want to use this loader is to add it to the TEMPLATE_LOADERS setting. If we put this code in

a package called mysite.zip_loader, then we add mysite.zip_loader.load_template_source to

TEMPLATE_LOADERS.

11.7 Conﬁguring the Template System in Standalone Mode

Note: This section is only of interest to people trying to use the template system as an output component in another

application. If you are using the template system as part of a Django application, the information presented here

doesn’t apply to you.

Normally, Django loads all the conﬁguration information it needs from its own default conﬁguration ﬁle, combined

with the settings in the module given in the DJANGO_SETTINGS_MODULE environment variable. (This was ex-

plained in “A special Python prompt” in Chapter 4.) But if you’re using the template system independently of the

rest of Django, the environment variable approach isn’t very convenient, because you probably want to conﬁgure the

template system in line with the rest of your application rather than dealing with settings ﬁles and pointing to them via

environment variables.

To solve this problem, you need to use the manual conﬁguration option described fully in Appendix D. In a nutshell,

you need to import the appropriate pieces of the template system and then, before you call any of the template functions,

call django.conf.settings.configure() with any settings you wish to specify.

You might want to consider setting at least TEMPLATE_DIRS (if you are going to use template loaders),

DEFAULT_CHARSET (although the default of utf-8 is probably ﬁne) and TEMPLATE_DEBUG. All available set-

tings are described in Appendix D, and any setting starting with TEMPLATE_ is of obvious interest.

11.8 What’s Next

Continuing this section’s theme of advanced topics, the next chapter covers advanced usage of Django models.

11.7. Conﬁguring the Template System in Standalone Mode 161

The Django Book, Release 2.0

162 Chapter 11. Chapter 9: Advanced Templates

CHAPTER 12

Chapter 10: Advanced Models

In Chapter 5, we presented an introduction to Django’s database layer – how to deﬁne models and how to use the

database API to create, retrieve, update and delete records. In this chapter, we’ll introduce you to some more advanced

features of this part of Django.

12.1 Related Objects

Recall our book models from Chapter 5:

from django.db import models

class Publisher(models.Model):

name =models.CharField(max_length=30)

address =models.CharField(max_length=50)

city =models.CharField(max_length=60)

state_province =models.CharField(max_length=30)

country =models.CharField(max_length=50)

website =models.URLField()

def __unicode__(self):

return self.name

class Author(models.Model):

first_name =models.CharField(max_length=30)

last_name =models.CharField(max_length=40)

email =models.EmailField()

def __unicode__(self):

return u’%s %s’%(self.first_name, self.last_name)

class Book(models.Model):

title =models.CharField(max_length=100)

authors =models.ManyToManyField(Author)

publisher =models.ForeignKey(Publisher)

publication_date =models.DateField()

def __unicode__(self):

return self.title

As we explained in Chapter 5, accessing the value for a particular ﬁeld on a database object is as straightforward as

using an attribute. For example, to determine the title of the book with ID 50, we’d do the following:

163

The Django Book, Release 2.0

>>> from mysite.books.models import Book

>>> b=Book.objects.get(id=50)

>>> b.title

u’The Django Book’

But one thing we didn’t mention previously is that related objects – ﬁelds expressed as either a ForeignKey or

ManyToManyField – act slightly differently.

12.1.1 Accessing Foreign Key Values

When you access a ﬁeld that’s a ForeignKey, you’ll get the related model object. For example:

>>> b=Book.objects.get(id=50)

>>> b.publisher

<Publisher: Apress Publishing>

>>> b.publisher.website

u’http://www.apress.com/’

With ForeignKey ﬁelds, it works the other way, too, but it’s slightly different due to the non-symmetrical nature of

the relationship. To get a list of books for a given publisher, use publisher.book_set.all(), like this:

>>> p=Publisher.objects.get(name=’Apress Publishing’)

>>> p.book_set.all()

[<Book: The Django Book>, <Book: Dive Into Python>, ...]

Behind the scenes, book_set is just a QuerySet (as covered in Chapter 5), and it can be ﬁltered and sliced like

any other QuerySet. For example:

>>> p=Publisher.objects.get(name=’Apress Publishing’)

>>> p.book_set.filter(title__icontains=’django’)

[<Book: The Django Book>, <Book: Pro Django>]

The attribute name book_set is generated by appending the lower case model name to _set.

12.1.2 Accessing Many-to-Many Values

Many-to-many values work like foreign-key values, except we deal with QuerySet values instead of model instances.

For example, here’s how to view the authors for a book:

>>> b=Book.objects.get(id=50)

>>> b.authors.all()

[<Author: Adrian Holovaty>, <Author: Jacob Kaplan-Moss>]

>>> b.authors.filter(first_name=’Adrian’)

[<Author: Adrian Holovaty>]

>>> b.authors.filter(first_name=’Adam’)

[]

It works in reverse, too. To view all of the books for an author, use author.book_set, like this:

>>> a=Author.objects.get(first_name=’Adrian’, last_name=’Holovaty’)

>>> a.book_set.all()

[<Book: The Django Book>, <Book: Adrian’s Other Book>]

Here, as with ForeignKey ﬁelds, the attribute name book_set is generated by appending the lower case model

name to _set.

164 Chapter 12. Chapter 10: Advanced Models

The Django Book, Release 2.0

12.2 Making Changes to a Database Schema

When we introduced the syncdb command in Chapter 5, we noted that syncdb merely creates tables that don’t yet

exist in your database – it does not sync changes in models or perform deletions of models. If you add or change

a model’s ﬁeld, or if you delete a model, you’ll need to make the change in your database manually. This section

explains how to do that.

When dealing with schema changes, it’s important to keep a few things in mind about how Django’s database layer

works:

• Django will complain loudly if a model contains a ﬁeld that has not yet been created in the database table. This

will cause an error the ﬁrst time you use the Django database API to query the given table (i.e., it will happen at

code execution time, not at compilation time).

• Django does not care if a database table contains columns that are not deﬁned in the model.

• Django does not care if a database contains a table that is not represented by a model.

Making schema changes is a matter of changing the various pieces – the Python code and the database itself – in the

right order.

12.2.1 Adding Fields

When adding a ﬁeld to a table/model in a production setting, the trick is to take advantage of the fact that Django

doesn’t care if a table contains columns that aren’t deﬁned in the model. The strategy is to add the column in the

database, and then update the Django model to include the new ﬁeld.

However, there’s a bit of a chicken-and-egg problem here, because in order to know how the new database column

should be expressed in SQL, you need to look at the output of Django’s manage.py sqlall command, which

requires that the ﬁeld exist in the model. (Note that you’re not required to create your column with exactly the same

SQL that Django would, but it’s a good idea to do so, just to be sure everything’s in sync.)

The solution to the chicken-and-egg problem is to use a development environment instead of making the changes on a

production server. (You are using a testing/development environment, right?) Here are the detailed steps to take.

First, take these steps in the development environment (i.e., not on the production server):

1. Add the ﬁeld to your model.

2. Run manage.py sqlall [yourapp] to see the new CREATE TABLE statement for the model. Note the

column deﬁnition for the new ﬁeld.

3. Start your database’s interactive shell (e.g., psql or mysql, or you can use manage.py dbshell). Execute

an ALTER TABLE statement that adds your new column.

4. Launch the Python interactive shell with manage.py shell and verify that the new ﬁeld was added properly

by importing the model and selecting from the table (e.g., MyModel.objects.all()[:5]). If you updated

the database correctly, the statement should work without errors.

Then on the production server perform these steps:

1. Start your database’s interactive shell.

2. Execute the ALTER TABLE statement you used in step 3 of the development environment steps.

3. Add the ﬁeld to your model. If you’re using source-code revision control and you checked in your change in

development environment step 1, now is the time to update the code (e.g., svn update, with Subversion) on

the production server.

4. Restart the Web server for the code changes to take effect.

12.2. Making Changes to a Database Schema 165

The Django Book, Release 2.0

For example, let’s walk through what we’d do if we added a num_pages ﬁeld to the Book model from Chapter 5.

First, we’d alter the model in our development environment to look like this:

class Book(models.Model):

title = models.CharField(max_length=100)

authors = models.ManyToManyField(Author)

publisher = models.ForeignKey(Publisher)

publication_date = models.DateField()

num_pages = models.IntegerField(blank=True, null=True)

def __unicode__(self):

return self.title

(Note: Read the section “Making Fields Optional” in Chapter 6, plus the sidebar “Adding NOT NULL Columns”

below for important details on why we included blank=True and null=True.)

Then we’d run the command manage.py sqlall books to see the CREATE TABLE statement. Depending on

your database backend, it would look something like this:

CREATE TABLE "books_book" (

"id" serial NOT NULL PRIMARY KEY,

"title" varchar(100) NOT NULL,

"publisher_id" integer NOT NULL REFERENCES "books_publisher" ("id"),

"publication_date" date NOT NULL,

"num_pages" integer NULL

);

The new column is represented like this:

"num_pages" integer NULL

Next, we’d start the database’s interactive shell for our development database by typing psql (for PostgreSQL), and

we’d execute the following statements:

ALTER TABLE books_book ADD COLUMN num_pages integer;

Adding NOT NULL Columns

There’s a subtlety here that deserves mention. When we added the num_pages ﬁeld to our model, we included the

blank=True and null=True options. We did this because a database column will contain NULL values when

you ﬁrst create it.

However, it’s also possible to add columns that cannot contain NULL values. To do this, you have to create the column

as NULL, then populate the column’s values using some default(s), and then alter the column to set the NOT NULL

modiﬁer. For example:

BEGIN;

ALTER TABLE books_book ADD COLUMN num_pages integer;

UPDATE books_book SET num_pages=0;

ALTER TABLE books_book ALTER COLUMN num_pages SET NOT NULL;

COMMIT;

If you go down this path, remember that you should leave off blank=True and null=True in your model (obvi-

ously).

After the ALTER TABLE statement, we’d verify that the change worked properly by starting the Python shell and

running this code:

166 Chapter 12. Chapter 10: Advanced Models

The Django Book, Release 2.0

>>> from mysite.books.models import Book

>>> Book.objects.all()[:5]

If that code didn’t cause errors, we’d switch to our production server and execute the ALTER TABLE statement on

the production database. Then, we’d update the model in the production environment and restart the Web server.

12.2.2 Removing Fields

Removing a ﬁeld from a model is a lot easier than adding one. To remove a ﬁeld, just follow these steps:

1. Remove the ﬁeld from your model and restart the Web server.

2. Remove the column from your database, using a command like this:

ALTER TABLE books_book DROP COLUMN num_pages;

Make sure to do it in this order. If you remove the column from your database ﬁrst, Django will immediately begin

raising errors.

12.2.3 Removing Many-to-Many Fields

Because many-to-many ﬁelds are different than normal ﬁelds, the removal process is different:

1. Remove the ManyToManyField from your model and restart the Web server.

2. Remove the many-to-many table from your database, using a command like this:

DROP TABLE books_book_authors;

As in the previous section, make sure to do it in this order.

12.2.4 Removing Models

Removing a model entirely is as easy as removing a ﬁeld. To remove a model, just follow these steps:

1. Remove the model from your models.py ﬁle and restart the Web server.

2. Remove the table from your database, using a command like this:

DROP TABLE books_book;

Note that you might need to remove any dependent tables from your database ﬁrst – e.g., any tables that have

foreign keys to books_book.

As in the previous sections, make sure to do it in this order.

12.3 Managers

In the statement Book.objects.all(),objects is a special attribute through which you query your database.

In Chapter 5, we brieﬂy identiﬁed this as the model’s manager. Now it’s time to dive a bit deeper into what managers

are and how you can use them.

In short, a model’s manager is an object through which Django models perform database queries. Each Django model

has at least one manager, and you can create custom managers in order to customize database access.

12.3. Managers 167

The Django Book, Release 2.0

There are two reasons you might want to create a custom manager: to add extra manager methods, and/or to modify

the initial QuerySet the manager returns.

12.3.1 Adding Extra Manager Methods

Adding extra manager methods is the preferred way to add “table-level” functionality to your models. (For “row-level”

functionality – i.e., functions that act on a single instance of a model object – use model methods, which are explained

later in this chapter.)

For example, let’s give our Book model a manager method title_count() that takes a keyword and returns the

number of books that have a title containing that keyword. (This example is slightly contrived, but it demonstrates

how managers work.)

# models.py

from django.db import models

# ... Author and Publisher models here ...

class BookManager(models.Manager):

def title_count(self, keyword):

return self.filter(title__icontains=keyword).count()

class Book(models.Model):

title = models.CharField(max_length=100)

authors = models.ManyToManyField(Author)

publisher = models.ForeignKey(Publisher)

publication_date = models.DateField()

num_pages = models.IntegerField(blank=True, null=True)

objects = BookManager()

def __unicode__(self):

return self.title

With this manager in place, we can now do this:

>>> Book.objects.title_count(’django’)

>>> Book.objects.title_count(’python’)

Here are some notes about the code:

• We’ve created a BookManager class that extends django.db.models.Manager. This has a single

method, title_count(), which does the calculation. Note that the method uses self.filter(), where

self refers to the manager itself.

• We’ve assigned BookManager() to the objects attribute on the model. This has the effect of replacing the

“default” manager for the model, which is called objects and is automatically created if you don’t specify

a custom manager. We call it objects rather than something else, so as to be consistent with automatically

created managers.

Why would we want to add a method such as title_count()? To encapsulate commonly executed queries so that

we don’t have to duplicate code.

168 Chapter 12. Chapter 10: Advanced Models

The Django Book, Release 2.0

12.3.2 Modifying Initial Manager QuerySets

A manager’s base QuerySet returns all objects in the system. For example, Book.objects.all() returns all

books in the book database.

You can override a manager’s base QuerySet by overriding the Manager.get_query_set() method.

get_query_set() should return a QuerySet with the properties you require.

For example, the following model has two managers – one that returns all objects, and one that returns only the books

by Roald Dahl.

from django.db import models

# First, define the Manager subclass.

class DahlBookManager(models.Manager):

def get_query_set(self):

return super(DahlBookManager, self).get_query_set().filter(author=’Roald Dahl’)

# Then hook it into the Book model explicitly.

class Book(models.Model):

title = models.CharField(max_length=100)

author = models.CharField(max_length=50)

# ...

objects = models.Manager() # The default manager.

dahl_objects = DahlBookManager() # The Dahl-specific manager.

With this sample model, Book.objects.all() will return all books in the database, but

Book.dahl_objects.all() will only return the ones written by Roald Dahl. Note that we explicitly

set objects to a vanilla Manager instance, because if we hadn’t, the only available manager would be

dahl_objects.

Of course, because get_query_set() returns a QuerySet object, you can use filter(),exclude() and

all the other QuerySet methods on it. So these statements are all legal:

Book.dahl_objects.all()

Book.dahl_objects.filter(title=’Matilda’)

Book.dahl_objects.count()

This example also pointed out another interesting technique: using multiple managers on the same model. You can

attach as many Manager() instances to a model as you’d like. This is an easy way to deﬁne common “ﬁlters” for

your models.

For example:

class MaleManager(models.Manager):

def get_query_set(self):

return super(MaleManager, self).get_query_set().filter(sex=’M’)

class FemaleManager(models.Manager):

def get_query_set(self):

return super(FemaleManager, self).get_query_set().filter(sex=’F’)

class Person(models.Model):

first_name =models.CharField(max_length=50)

last_name =models.CharField(max_length=50)

sex =models.CharField(max_length=1, choices=((’M’,’Male’), (’F’,’Female’)))

people =models.Manager()

12.3. Managers 169

The Django Book, Release 2.0

men =MaleManager()

women =FemaleManager()

This example allows you to request Person.men.all(),Person.women.all(), and

Person.people.all(), yielding predictable results.

If you use custom Manager objects, take note that the ﬁrst Manager Django encounters (in the order in which they’re

deﬁned in the model) has a special status. Django interprets this ﬁrst Manager deﬁned in a class as the “default”

Manager, and several parts of Django (though not the admin application) will use that Manager exclusively for that

model. As a result, it’s often a good idea to be careful in your choice of default manager, in order to avoid a situation

where overriding of get_query_set() results in an inability to retrieve objects you’d like to work with.

12.4 Model methods

Deﬁne custom methods on a model to add custom “row-level” functionality to your objects. Whereas managers are

intended to do “table-wide” things, model methods should act on a particular model instance.

This is a valuable technique for keeping business logic in one place – the model.

An example is the easiest way to explain this. Here’s a model with a few custom methods:

from django.contrib.localflavor.us.models import USStateField

from django.db import models

class Person(models.Model):

first_name =models.CharField(max_length=50)

last_name =models.CharField(max_length=50)

birth_date =models.DateField()

address =models.CharField(max_length=100)

city =models.CharField(max_length=50)

state =USStateField() # Yes, this is U.S.-centric...

def baby_boomer_status(self):

"Returns the person’s baby-boomer status."

import datetime

if datetime.date(1945,8,1)<= self.birth_date <= datetime.date(1964,12,31):

return "Baby boomer"

if self.birth_date <datetime.date(1945,8,1):

return "Pre-boomer"

return "Post-boomer"

def is_midwestern(self):

"Returns True if this person is from the Midwest."

return self.state in (’IL’,’WI’,’MI’,’IN’,’OH’,’IA’,’MO’)

def _get_full_name(self):

"Returns the person’s full name."

return u’%s %s’%(self.first_name, self.last_name)

full_name =property(_get_full_name)

The last method in this example is a “property.” Read more about properties at

http://www.python.org/download/releases/2.2/descrintro/#property

And here’s example usage:

>>> p=Person.objects.get(first_name=’Barack’, last_name=’Obama’)

>>> p.birth_date

170 Chapter 12. Chapter 10: Advanced Models

The Django Book, Release 2.0

datetime.date(1961, 8, 4)

>>> p.baby_boomer_status()

’Baby boomer’

>>> p.is_midwestern()

True

>>> p.full_name # Note this isn’t a method -- it’s treated as an attribute

u’Barack Obama’

12.5 Executing Raw SQL Queries

Sometimes you’ll ﬁnd that the Django database API can only take you so far, and you’ll want to write custom SQL

queries against your database. You can do this very easily by accessing the object django.db.connection,

which represents the current database connection. To use it, call connection.cursor() to get a cursor ob-

ject. Then, call cursor.execute(sql, [params]) to execute the SQL and cursor.fetchone() or

cursor.fetchall() to return the resulting rows. For example:

>>> from django.db import connection

>>> cursor =connection.cursor()

>>> cursor.execute("""

... SELECT DISTINCT first_name

... FROM people_person

... WHERE last_name = %s""", [’Lennon’])

>>> row =cursor.fetchone()

>>> print row

[’John’]

connection and cursor mostly implement the standard Python “DB-API,” which you can read about at

http://www.python.org/peps/pep-0249.html. If you’re not familiar with the Python DB-API, note that the SQL state-

ment in cursor.execute() uses placeholders, "%s", rather than adding parameters directly within the SQL. If

you use this technique, the underlying database library will automatically add quotes and escaping to your parameter(s)

as necessary.

Rather than littering your view code with these django.db.connection statements, it’s a good idea to put them

in custom model methods or manager methods. For example, the above example could be integrated into a custom

manager method like this:

from django.db import connection, models

class PersonManager(models.Manager):

def first_names(self, last_name):

cursor =connection.cursor()

cursor.execute("""

SELECT DISTINCT first_name

FROM people_person

WHERE last_name = %s""", [last_name])

return [row[0]for row in cursor.fetchone()]

class Person(models.Model):

first_name =models.CharField(max_length=50)

last_name =models.CharField(max_length=50)

objects =PersonManager()

And sample usage:

>>> Person.objects.first_names(’Lennon’)

[’John’, ’Cynthia’]

12.5. Executing Raw SQL Queries 171

The Django Book, Release 2.0

12.6 What’s Next?

In the next chapter, we’ll show you Django’s “generic views” framework, which lets you save time in building Web

sites that follow common patterns.

172 Chapter 12. Chapter 10: Advanced Models

CHAPTER 13

Chapter 11: Generic Views

Here again is a recurring theme of this book: at its worst, Web development is boring and monotonous. So far, we’ve

covered how Django tries to take away some of that monotony at the model and template layers, but Web developers

also experience this boredom at the view level.

Django’s generic views were developed to ease that pain. They take certain common idioms and patterns found in

view development and abstract them so that you can quickly write common views of data without having to write too

much code. In fact, nearly every view example in the preceding chapters could be rewritten with the help of generic

views.

Chapter 8 touched brieﬂy on how you’d go about making a view “generic.” To review, we can recognize certain

common tasks, like displaying a list of objects, and write code that displays a list of any object. Then the model in

question can be passed as an extra argument to the URLconf.

Django ships with generic views to do the following:

• Perform common “simple” tasks: redirect to a different page, or render a given template.

• Display “list” and “detail” pages for a single object. The event_list and entry_list views from Chapter

8 are examples of list views. A single event page is an example of what we call a “detail” view.

• Present date-based objects in year/month/day archive pages, associated detail, and “latest” pages. The Django

Weblog’s (http://www.djangoproject.com/weblog/) year, month, and day archives are built with these, as would

be a typical newspaper’s archives.

Taken together, these views provide easy interfaces to perform the most common tasks developers encounter.

13.1 Using Generic Views

All of these views are used by creating conﬁguration dictionaries in your URLconf ﬁles and passing those dictionaries

as the third member of the URLconf tuple for a given pattern. (See “Passing Extra Options to View Functions” in

Chapter 8 for an overview of this technique.)

For example, here’s a simple URLconf you could use to present a static “about” page:

from django.conf.urls.defaults import *

from django.views.generic.simple import direct_to_template

urlpatterns =patterns(’’,

(r’^about/$’, direct_to_template, {

’template’:’about.html’

})

)

173

The Django Book, Release 2.0

Though this might seem a bit “magical” at ﬁrst glance – look, a view with no code! –, it’s actually exactly the same as

the examples in Chapter 8: the direct_to_template view simply grabs information from the extra-parameters

dictionary and uses that information when rendering the view.

Because this generic view – and all the others – is a regular view function like any other, we can reuse it inside our

own views. As an example, let’s extend our “about” example to map URLs of the form /about/<whatever>/ to

statically rendered about/<whatever>.html. We’ll do this by ﬁrst modifying the URLconf to point to a view

function:

from django.conf.urls.defaults import *

from django.views.generic.simple import direct_to_template

from mysite.books.views import about_pages

urlpatterns = patterns(‘’,

(r’^about/$’, direct_to_template, {

‘template’: ‘about.html’

}),

(r’^about/(\w+)/$’, about_pages),

)

Next, we’ll write the about_pages view:

from django.http import Http404

from django.template import TemplateDoesNotExist

from django.views.generic.simple import direct_to_template

def about_pages(request, page):

try:

return direct_to_template(request, template="about/%s.html" %page)

except TemplateDoesNotExist:

raise Http404()

Here we’re treating direct_to_template like any other function. Since it returns an HttpResponse, we

can simply return it as-is. The only slightly tricky business here is dealing with missing templates. We don’t want

a nonexistent template to cause a server error, so we catch TemplateDoesNotExist exceptions and return 404

errors instead.

Is There a Security Vulnerability Here?

Sharp-eyed readers may have noticed a possible security hole: we’re constructing the template name using interpolated

content from the browser (template="about/%s.html" % page). At ﬁrst glance, this looks like a classic

directory traversal vulnerability (discussed in detail in Chapter 20). But is it really?

Not exactly. Yes, a maliciously crafted value of page could cause directory traversal, but although page is taken

from the request URL, not every value will be accepted. The key is in the URLconf: we’re using the regular expression

\w+ to match the page part of the URL, and \w only accepts letters and numbers. Thus, any malicious characters

(such as dots and slashes) will be rejected by the URL resolver before they reach the view itself.

13.2 Generic Views of Objects

The direct_to_template view certainly is useful, but Django’s generic views really shine when it comes to

presenting views on your database content. Because it’s such a common task, Django comes with a handful of built-in

generic views that make generating list and detail views of objects incredibly easy.

Let’s take a look at one of these generic views: the “object list” view. We’ll be using this Publisher object from

Chapter 5:

174 Chapter 13. Chapter 11: Generic Views

The Django Book, Release 2.0

class Publisher(models.Model):

name =models.CharField(max_length=30)

address =models.CharField(max_length=50)

city =models.CharField(max_length=60)

state_province =models.CharField(max_length=30)

country =models.CharField(max_length=50)

website =models.URLField()

def __unicode__(self):

return self.name

class Meta:

ordering =[’name’]

To build a list page of all publishers, we’d use a URLconf along these lines:

from django.conf.urls.defaults import *

from django.views.generic import list_detail

from mysite.books.models import Publisher

publisher_info ={

’queryset’: Publisher.objects.all(),

}

urlpatterns =patterns(’’,

(r’^publishers/$’, list_detail.object_list, publisher_info)

)

That’s all the Python code we need to write. We still need to write a template, however. We can explicitly tell the

object_list view which template to use by including a template_name key in the extra arguments dictionary:

from django.conf.urls.defaults import *

from django.views.generic import list_detail

from mysite.books.models import Publisher

publisher_info = {

‘queryset’: Publisher.objects.all(),

‘template_name’: ‘publisher_list_page.html’,

}

urlpatterns = patterns(‘’,

(r’^publishers/$’, list_detail.object_list, publisher_info)

)

In the absence of template_name, though, the object_list generic view will infer one from the object’s name.

In this case, the inferred template will be "books/publisher_list.html" – the “books” part comes from the

name of the app that deﬁnes the model, while the “publisher” bit is just the lowercased version of the model’s name.

This template will be rendered against a context containing a variable called object_list that contains all the

publisher objects. A very simple template might look like the following:

{% extends "base.html" %}

{% block content %}

<h2>Publishers</h2>

<ul>

{% for publisher in object_list %}

<li>{{ publisher.name }}</li>

{% endfor %}

13.2. Generic Views of Objects 175

The Django Book, Release 2.0

</ul>

{% endblock %}

(Note that this assumes the existence of a base.html template, as we provided in an example in Chapter 4.)

That’s really all there is to it. All the cool features of generic views come from changing the “info” dictionary passed

to the generic view. Appendix C documents all the generic views and all their options in detail; the rest of this chapter

will consider some of the common ways you might customize and extend generic views.

13.3 Extending Generic Views

There’s no question that using generic views can speed up development substantially. In most projects, however, there

comes a moment when the generic views no longer sufﬁce. Indeed, one of the most common questions asked by new

Django developers is how to make generic views handle a wider array of situations.

Luckily, in nearly every one of these cases, there are ways to simply extend generic views to handle a larger array of

use cases. These situations usually fall into a handful of patterns dealt with in the sections that follow.

13.3.1 Making “Friendly” Template Contexts

You might have noticed that sample publisher list template stores all the books in a variable named object_list.

While this works just ﬁne, it isn’t all that “friendly” to template authors: they have to “just know” that they’re dealing

with books here. A better name for that variable would be publisher_list; that variable’s content is pretty

obvious.

We can change the name of that variable easily with the template_object_name argument:

from django.conf.urls.defaults import *

from django.views.generic import list_detail

from mysite.books.models import Publisher

publisher_info ={

’queryset’: Publisher.objects.all(),

’template_name’:’publisher_list_page.html’,

’template_object_name’:’publisher’,

}

urlpatterns =patterns(’’,

(r’^publishers/$’, list_detail.object_list, publisher_info)

)

In the template, the generic view will append _list to the template_object_name to create the variable name

representing the list of items.

Providing a useful template_object_name is always a good idea. Your coworkers who design templates will

thank you.

13.3.2 Adding Extra Context

Sometimes, you might need to present some extra information beyond that provided by the generic view. For example,

think of showing a list of all the other publishers on each publisher detail page. The object_detail generic view

provides the publisher to the context, but it seems there’s no way to get a list of all publishers in that template.

176 Chapter 13. Chapter 11: Generic Views

The Django Book, Release 2.0

But there is: all generic views take an extra optional parameter, extra_context. This is a dictionary of extra

objects that will be added to the template’s context. So, to provide the list of all publishers on the detail view, we’d

use an info dictionary like this:

publisher_info = {

‘queryset’: Publisher.objects.all(),

‘template_object_name’: ‘publisher’,

‘extra_context’: {‘book_list’: Book.objects.all()}

}

This would populate a {{ book_list }} variable in the template context. This pattern can be used to pass any

information down into the template for the generic view. It’s very handy.

However, there’s actually a subtle bug here – can you spot it?

The problem has to do with when the queries in extra_context are evaluated. Because this example puts

Book.objects.all() in the URLconf, it will be evaluated only once (when the URLconf is ﬁrst loaded). Once

you add or remove publishers, you’ll notice that the generic view doesn’t reﬂect those changes until you reload the

Web server (see “Caching and QuerySets” in Appendix B for more information about when QuerySet objects are

cached and evaluated).

Note: This problem doesn’t apply to the queryset generic view argument. Since Django knows that particular

QuerySet should never be cached, the generic view takes care of clearing the cache when each view is rendered.

The solution is to use a callback in extra_context instead of a value. Any callable (i.e., a function) that’s passed

to extra_context will be evaluated when the view is rendered (instead of only once). You could do this with an

explicitly deﬁned function:

def get_books():

return Book.objects.all()

publisher_info = {

‘queryset’: Publisher.objects.all(),

‘template_object_name’: ‘publisher’,

‘extra_context’: {‘book_list’: get_books}

}

Or, you could use a less obvious but shorter version that relies on the fact that Book.objects.all is itself a

callable:

publisher_info = {

‘queryset’: Publisher.objects.all(),

‘template_object_name’: ‘publisher’,

‘extra_context’: {‘book_list’: Book.objects.all}

}

Notice the lack of parentheses after Book.objects.all. This references the function without actually calling it

(which the generic view will do later).

13.3.3 Viewing Subsets of Objects

Now let’s take a closer look at this queryset key we’ve been using all along. Most generic views take one of these

queryset arguments – it’s how the view knows which set of objects to display (see “Selecting Objects” in Chapter

5 for an introduction to QuerySet objects, and see Appendix B for the complete details).

To pick a simple example, we might want to order a list of books by publication date, with the most recent ﬁrst:

13.3. Extending Generic Views 177

The Django Book, Release 2.0

book_info = {

‘queryset’: Book.objects.order_by(‘-publication_date’),

}

urlpatterns = patterns(‘’,

(r’^publishers/$’, list_detail.object_list, publisher_info),

(r’^books/$’, list_detail.object_list, book_info),

)

That’s a pretty simple example, but it illustrates the idea nicely. Of course, you’ll usually want to do more than just

reorder objects. If you want to present a list of books by a particular publisher, you can use the same technique:

apress_books = {

‘queryset’: Book.objects.filter(publisher__name=’Apress Publishing’),

‘template_name’: ‘books/apress_list.html’

}

urlpatterns = patterns(‘’,

(r’^publishers/$’, list_detail.object_list, publisher_info),

(r’^books/apress/$’, list_detail.object_list, apress_books),

)

Notice that along with a ﬁltered queryset, we’re also using a custom template name. If we didn’t, the generic view

would use the same template as the “vanilla” object list, which might not be what we want.

Also notice that this isn’t a very elegant way of doing publisher-speciﬁc books. If we want to add another publisher

page, we’d need another handful of lines in the URLconf, and more than a few publishers would get unreasonable.

We’ll deal with this problem in the next section.

13.3.4 Complex Filtering with Wrapper Functions

Another common need is to ﬁlter the objects given in a list page by some key in the URL. Earlier we hard-coded the

publisher’s name in the URLconf, but what if we wanted to write a view that displayed all the books by some arbitrary

publisher? The solution is to “wrap” the object_list generic view to avoid writing a lot of code by hand. As

usual, we’ll start by writing a URLconf:

urlpatterns = patterns(‘’,

(r’^publishers/$’, list_detail.object_list, publisher_info),

(r’^books/(\w+)/$’, books_by_publisher),

)

Next, we’ll write the books_by_publisher view itself:

from django.shortcuts import get_object_or_404

from django.views.generic import list_detail

from mysite.books.models import Book, Publisher

def books_by_publisher(request, name):

# Look up the publisher (and raise a 404 if it can’t be found).

publisher =get_object_or_404(Publisher, name__iexact=name)

# Use the object_list view for the heavy lifting.

return list_detail.object_list(

request,

queryset =Book.objects.filter(publisher=publisher),

template_name =’books/books_by_publisher.html’,

178 Chapter 13. Chapter 11: Generic Views

The Django Book, Release 2.0

template_object_name =’book’,

extra_context ={’publisher’: publisher}

)

This works because there’s really nothing special about generic views – they’re just Python functions. Like any view

function, generic views expect a certain set of arguments and return HttpResponse objects. Thus, it’s incredibly

easy to wrap a small function around a generic view that does additional work before (or after; see the next section)

handing things off to the generic view.

Note: Notice that in the preceding example we passed the current publisher being displayed in the extra_context.

This is usually a good idea in wrappers of this nature; it lets the template know which “parent” object is currently being

browsed.

13.3.5 Performing Extra Work

The last common pattern we’ll look at involves doing some extra work before or after calling the generic view.

Imagine we had a last_accessed ﬁeld on our Author object that we were using to keep track of the last time

anybody looked at that author. The generic object_detail view, of course, wouldn’t know anything about this

ﬁeld, but once again we could easily write a custom view to keep that ﬁeld updated.

First, we’d need to add an author detail bit in the URLconf to point to a custom view:

from mysite.books.views import author_detail

urlpatterns = patterns(‘’,

# ...

(r’^authors/(?P<author_id>\d+)/$’, author_detail),

# ...

)

Then we’d write our wrapper function:

import datetime

from django.shortcuts import get_object_or_404

from django.views.generic import list_detail

from mysite.books.models import Author

def author_detail(request, author_id):

# Delegate to the generic view and get an HttpResponse.

response =list_detail.object_detail(

request,

queryset =Author.objects.all(),

object_id =author_id,

)

# Record the last accessed date. We do this *after*the call

# to object_detail(), not before it, so that this won’t be called

# unless the Author actually exists. (If the author doesn’t exist,

# object_detail() will raise Http404, and we won’t reach this point.)

now =datetime.datetime.now()

Author.objects.filter(id=author_id).update(last_accessed=now)

return response

Note: This code won’t actually work unless you add a last_accessed ﬁeld to your Author model and create a

13.3. Extending Generic Views 179

The Django Book, Release 2.0

books/author_detail.html template.

We can use a similar idiom to alter the response returned by the generic view. If we wanted to provide a downloadable

plain-text version of the list of authors, we could use a view like this:

def author_list_plaintext(request):

response =list_detail.object_list(

request,

queryset =Author.objects.all(),

mimetype =’text/plain’,

template_name =’books/author_list.txt’

)

response["Content-Disposition"]="attachment; filename=authors.txt"

return response

This works because the generic views return simple HttpResponse objects that can be treated like dictionaries to

set HTTP headers. This Content-Disposition business, by the way, instructs the browser to download and save

the page instead of displaying it in the browser.

13.4 What’s Next?

In this chapter we looked at only a couple of the generic views Django ships with, but the general ideas presented

here should apply pretty closely to any generic view. Appendix C covers all the available views in detail, and it’s

recommended reading if you want to get the most out of this powerful feature.

This concludes the section of this book devoted to “advanced usage.” In the next chapter, we cover deployment of

Django applications.

180 Chapter 13. Chapter 11: Generic Views

CHAPTER 14

Chapter 12: Deploying Django

This chapter covers the last essential step of building a Django application: deploying it to a production server.

If you’ve been following along with our ongoing examples, you’ve likely been using the runserver, which makes

things very easy – with runserver, you don’t have to worry about Web server setup. But runserver is intended

only for development on your local machine, not for exposure on the public Web. To deploy your Django application,

you’ll need to hook it into an industrial-strength Web server such as Apache. In this chapter, we’ll show you how to

do that – but, ﬁrst, we’ll give you a checklist of things to do in your codebase before you go live.

14.1 Preparing Your Codebase for Production

Fortunately, the runserver approximates a “real” Web server closely enough that not very many changes need to

be made to a Django application in order to make it production-ready. But there are a few essential things you should

do before you turn the switch.

14.1.1 Turning Off Debug Mode

When we created a project in Chapter 2, the command django-admin.py startproject created a

settings.py ﬁle with DEBUG set to True. Many internal parts of Django check this setting and change their

behavior if DEBUG mode is on. For example, if DEBUG is set to True, then:

• All database queries will be saved in memory as the object django.db.connection.queries. As you

can imagine, this eats up memory!

• Any 404 error will be rendered by Django’s special 404 error page (covered in Chapter 3) rather than returning

a proper 404 response. This page contains potentially sensitive information and should not be exposed to the

public Internet.

• Any uncaught exception in your Django application – from basic Python syntax errors to database errors to

template syntax errors – will be rendered by the Django pretty error page that you’ve likely come to know and

love. This page contains even more sensitive information than the 404 page and should never be exposed to the

public.

In short, setting DEBUG to True tells Django to assume only trusted developers are using your site. The Internet is full

of untrustworthy hooligans, and the ﬁrst thing you should do when you’re preparing your application for deployment

is set DEBUG to False.

181

The Django Book, Release 2.0

14.1.2 Turning Off Template Debug Mode

Similarly, you should set TEMPLATE_DEBUG to False in production. If True, this setting tells Django’s template

system to save some extra information about every template, for use on the pretty error pages.

14.1.3 Implementing a 404 Template

If DEBUG is True, Django displays the useful 404 error page. But if DEBUG is False, then it does something

different: it renders a template called 404.html in your root template directory. So, when you’re ready to deploy,

you’ll need to create this template and put a useful “Page not found” message in it.

Here’s a sample 404.html you can use as a starting point. It assumes you’re using template inheritance and have

deﬁned a base.html with blocks called title and content.

{% extends "base.html" %}

{% block title %}Page not found{% endblock %}

{% block content %}

<h1>Page not found</h1>

<p>Sorry, but the requested page could not be found.</p>

{% endblock %}

To test that your 404.html is working, just change DEBUG to False and visit a nonexistent URL. (This works on

the runserver just as well as it works on a production server.)

14.1.4 Implementing a 500 Template

Similarly, if DEBUG is False, then Django no longer displays its useful error/traceback pages in case of an unhandled

Python exception. Instead, it looks for a template called 500.html and renders it. Like 404.html, this template

should live in your root template directory.

There’s one slightly tricky thing about 500.html. You can never be sure why this template is being rendered, so it

shouldn’t do anything that requires a database connection or relies on any potentially broken part of your infrastructure.

(For example, it should not use custom template tags.) If it uses template inheritance, then the parent template(s)

shouldn’t rely on potentially broken infrastructure, either. Therefore, the best approach is to avoid template inheritance

and use something very simple. Here’s an example 500.html as a starting point:

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN"

"http://www.w3.org/TR/html4/strict.dtd">

<head>

<title>Page unavailable</title>

</head>

<body>

<h1>Page unavailable</h1>

<p>Sorry, but the requested page is unavailable due to a

server hiccup.</p>

<p>Our engineers have been notified, so check back later.</p>

</body>

</html>

182 Chapter 14. Chapter 12: Deploying Django

The Django Book, Release 2.0

14.1.5 Setting Up Error Alerts

When your Django-powered site is running and an exception is raised, you’ll want to know about it, so you can ﬁx

it. By default, Django is conﬁgured to send an e-mail to the site developers whenever your code raises an unhandled

exception – but you need to do two things to set it up.

First, change your ADMINS setting to include your e-mail address, along with the e-mail addresses of any other people

who need to be notiﬁed. This setting takes a tuple of (name, email) tuples, like this:

ADMINS =(

(’John Lennon’,’jlennon@example.com’),

(’Paul McCartney’,’pmacca@example.com’),

)

Second, make sure your server is conﬁgured to send e-mail. Setting up postfix,sendmail or any other mail server

is outside the scope of this book, but on the Django side of things, you’ll want to make sure your EMAIL_HOST setting

is set to the proper hostname for your mail server. It’s set to ’localhost’ by default, which works out of the box for

most shared-hosting environments. You might also need to set EMAIL_HOST_USER,EMAIL_HOST_PASSWORD,

EMAIL_PORT or EMAIL_USE_TLS, depending on the complexity of your arrangement.

Also, you can set EMAIL_SUBJECT_PREFIX to control the preﬁx Django uses in front of its error e-mails. It’s set

to ’[Django] ’ by default.

14.1.6 Setting Up Broken Link Alerts

If you have the CommonMiddleware installed (e.g., if your MIDDLEWARE_CLASSES setting includes

’django.middleware.common.CommonMiddleware’, which it does by default), then you have the option

of receiving an e-mail any time somebody visits a page on your Django-powered site that raises 404 with a non-empty

referrer – that is, every broken link. If you want to activate this feature, set SEND_BROKEN_LINK_EMAILS to True

(it’s False by default), and set your MANAGERS setting to a person or people who will receive these broken-link e-

mails. MANAGERS uses the same syntax as ADMINS. For example:

MANAGERS =(

(’George Harrison’,’gharrison@example.com’),

(’Ringo Starr’,’ringo@example.com’),

)

Note that error e-mails can get annoying; they’re not for everybody.

14.2 Using Different Settings for Production

So far in this book, we’ve dealt with only a single settings ﬁle: the settings.py generated by

django-admin.py startproject. But as you get ready to deploy, you’ll likely ﬁnd yourself needing mul-

tiple settings ﬁles to keep your development environment isolated from your production environment. (For example,

you probably won’t want to change DEBUG from False to True whenever you want to test code changes on your

local machine.) Django makes this very easy by allowing you to use multiple settings ﬁles.

If you’d like to organize your settings ﬁles into “production” and “development” settings, you can accomplish this in

one of three ways:

• Set up two full-blown, independent settings ﬁles.

• Set up a “base” settings ﬁle (say, for development) and a second (say, production) settings ﬁle that merely

imports from the ﬁrst one and deﬁnes whatever overrides it needs to deﬁne.

• Use only a single settings ﬁle that has Python logic to change the settings based on context.

14.2. Using Different Settings for Production 183

The Django Book, Release 2.0

We’ll take these one at a time.

First, the most basic approach is to deﬁne two separate settings ﬁles. If you’re following along, you’ve already got

settings.py. Now, just make a copy of it called settings_production.py. (We made this name up; you

can call it whatever you want.) In this new ﬁle, change DEBUG, etc.

The second approach is similar but cuts down on redundancy. Instead of having two settings ﬁles whose contents are

mostly similar, you can treat one as the “base” ﬁle and create another ﬁle that imports from it. For example:

# settings.py

DEBUG =True

TEMPLATE_DEBUG =DEBUG

DATABASE_ENGINE =’postgresql_psycopg2’

DATABASE_NAME =’devdb’

DATABASE_USER =’’

DATABASE_PASSWORD =’’

DATABASE_PORT =’’

# ...

# settings_production.py

from settings import *

DEBUG =TEMPLATE_DEBUG =False

DATABASE_NAME =’production’

DATABASE_USER =’app’

DATABASE_PASSWORD =’letmein’

Here, settings_production.py imports everything from settings.py and just redeﬁnes the settings that

are particular to production. In this case, DEBUG is set to False, but we’ve also set different database access param-

eters for the production setting. (The latter goes to show that you can redeﬁne any setting, not just the basic ones like

DEBUG.)

Finally, the most concise way of accomplishing two settings environments is to use a single settings ﬁle that branches

based on the environment. One way to do this is to check the current hostname. For example:

# settings.py

import socket

if socket.gethostname() == ’my-laptop’:

DEBUG =TEMPLATE_DEBUG =True

else:

DEBUG =TEMPLATE_DEBUG =False

# ...

Here, we import the socket module from Python’s standard library and use it to check the current system’s hostname.

We can check the hostname to determine whether the code is being run on the production server.

A core lesson here is that settings ﬁles are just Python code. They can import from other ﬁles, they can execute

arbitrary logic, etc. Just make sure that, if you go down this road, the Python code in your settings ﬁles is bulletproof.

If it raises any exceptions, Django will likely crash badly.

Renaming settings.py

Feel free to rename your settings.py to settings_dev.py or settings/dev.py or foobar.py –

184 Chapter 14. Chapter 12: Deploying Django

The Django Book, Release 2.0

Django doesn’t care, as long as you tell it what settings ﬁle you’re using.

But if you do rename the settings.py ﬁle that is generated by django-admin.py startproject, you’ll

ﬁnd that manage.py will give you an error message saying that it can’t ﬁnd the settings. That’s because it tries

to import a module called settings. You can ﬁx this either by editing manage.py to change settings to

the name of your module, or by using django-admin.py instead of manage.py. In the latter case, you’ll

need to set the DJANGO_SETTINGS_MODULE environment variable to the Python path to your settings ﬁle (e.g.,

’mysite.settings’).

14.3 DJANGO_SETTINGS_MODULE

With those code changes out of the way, the next part of this chapter will focus on deployment instructions for spe-

ciﬁc environments, such as Apache. The instructions are different for each environment, but one thing remains the

same: in each case, you will have to tell the Web server your DJANGO_SETTINGS_MODULE. This is the entry point

into your Django application. The DJANGO_SETTINGS_MODULE points to your settings ﬁle, which points to your

ROOT_URLCONF, which points to your views, and so on.

DJANGO_SETTINGS_MODULE is the Python path to your settings ﬁle. For example, assuming the mysite directory

is on your Python path, the DJANGO_SETTINGS_MODULE for our ongoing example is ’mysite.settings’.

14.4 Using Django with Apache and mod_python

Apache with mod_python historically has been the suggested setup for using Django on a production server.

mod_python (http://www.djangoproject.com/r/mod_python/) is an Apache plug-in that embeds Python within Apache

and loads Python code into memory when the server starts. Code stays in memory throughout the life of an Apache

process, which leads to signiﬁcant performance gains over other server arrangements.

Django requires Apache 2.x and mod_python 3.x.

Note: Conﬁguring Apache is well beyond the scope of this book, so we’ll simply mention details as needed. Luckily,

many great resources are available if you need to learn more about Apache. A few of them we like are:

• The free online Apache documentation, available via http://www.djangoproject.com/r/apache/docs/

•Pro Apache, Third Edition (Apress, 2004) by Peter Wainwright, available via

http://www.djangoproject.com/r/books/pro-apache/

•Apache: The Deﬁnitive Guide, Third Edition (O’Reilly, 2002) by Ben Laurie and Peter Laurie, available via

http://www.djangoproject.com/r/books/pro-apache/

14.4.1 Basic Conﬁguration

To conﬁgure Django with mod_python, ﬁrst make sure you have Apache installed with the mod_python module acti-

vated. This usually means having a LoadModule directive in your Apache conﬁguration ﬁle. It will look something

like this:

LoadModule python_module /usr/lib/apache2/modules/mod_python.so

Then, edit your Apache conﬁguration ﬁle and add a <Location> directive that ties a speciﬁc URL path to a speciﬁc

Django installation. For example:

14.3. DJANGO_SETTINGS_MODULE 185

The Django Book, Release 2.0

SetHandler python-program

PythonHandler django.core.handlers.modpython

SetEnv DJANGO_SETTINGS_MODULE mysite.settings

PythonDebug Off

</Location>

Make sure to replace mysite.settings with the appropriate DJANGO_SETTINGS_MODULE for your site.

This tells Apache, “Use mod_python for any URL at or under ‘/’, using the Django mod_python handler.” It passes

the value of DJANGO_SETTINGS_MODULE so mod_python knows which settings to use.

Note that we’re using the <Location> directive, not the <Directory> directive. The latter is used for pointing at

places on your ﬁlesystem, whereas <Location> points at places in the URL structure of a Web site. <Directory>

would be meaningless here.

Apache likely runs as a different user than your normal login and may have a different path and sys.path. You may

need to tell mod_python how to ﬁnd your project and Django itself.

PythonPath "[’/path/to/project’, ’/path/to/django’] + sys.path"

You can also add directives such as PythonAutoReload Off for performance. See the mod_python documenta-

tion for a full list of options.

Note that you should set PythonDebug Off on a production server. If you leave PythonDebug On, your users

will see ugly (and revealing) Python tracebacks if something goes wrong within mod_python.

Restart Apache, and any request to your site (or virtual host if you’ve put this directive inside a <VirtualHost>

block) will be served by Django.

14.4.2 Running Multiple Django Installations on the Same Apache Instance

It’s entirely possible to run multiple Django installations on the same Apache instance. You might want to do this if

you’re an independent Web developer with multiple clients but only a single server.

To accomplish this, just use VirtualHost like so:

NameVirtualHost *

ServerName www.example.com

# ...

SetEnv DJANGO_SETTINGS_MODULE mysite.settings

</VirtualHost>

ServerName www2.example.com

# ...

SetEnv DJANGO_SETTINGS_MODULE mysite.other_settings

</VirtualHost>

If you need to put two Django installations within the same VirtualHost, you’ll need to take a special precaution to

ensure mod_python’s code cache doesn’t mess things up. Use the PythonInterpreter directive to give different

<Location> directives separate interpreters:

ServerName www.example.com

# ...

186 Chapter 14. Chapter 12: Deploying Django

The Django Book, Release 2.0

SetEnv DJANGO_SETTINGS_MODULE mysite.settings

PythonInterpreter mysite

</Location>

SetEnv DJANGO_SETTINGS_MODULE mysite.other_settings

PythonInterpreter mysite_other

</Location>

</VirtualHost>

The values of PythonInterpreter don’t really matter, as long as they’re different between the two Location

blocks.

14.4.3 Running a Development Server with mod_python

Because mod_python caches loaded Python code, when deploying Django sites on mod_python you’ll need to restart

Apache each time you make changes to your code. This can be a hassle, so here’s a quick trick to avoid it: just add

MaxRequestsPerChild 1 to your conﬁg ﬁle to force Apache to reload everything for each request. But don’t do

that on a production server, or we’ll revoke your Django privileges.

If you’re the type of programmer who debugs using scattered print statements (we are), note that print statements

have no effect in mod_python; they don’t appear in the Apache log, as you might expect. If you have the need to print

debugging information in a mod_python setup, you’ll probably want to use Python’s standard logging package. More

information is available at http://docs.python.org/lib/module-logging.html.

14.4.4 Serving Django and Media Files from the Same Apache Instance

Django should not be used to serve media ﬁles itself; leave that job to whichever Web server you choose. We recom-

mend using a separate Web server (i.e., one that’s not also running Django) for serving media. For more information,

see the “Scaling” section.

If, however, you have no option but to serve media ﬁles on the same Apache VirtualHost as Django, here’s how

you can turn off mod_python for a particular part of the site:

SetHandler None

</Location>

Change Location to the root URL of your media ﬁles.

You can also use <LocationMatch> to match a regular expression. For example, this sets up Django at the site

root but explicitly disables Django for the media subdirectory and any URL that ends with .jpg,.gif, or .png:

SetHandler python-program

PythonHandler django.core.handlers.modpython

SetEnv DJANGO_SETTINGS_MODULE mysite.settings

</Location>

SetHandler None

</Location>

SetHandler None

</LocationMatch>

14.4. Using Django with Apache and mod_python 187

The Django Book, Release 2.0

In all of these cases, you’ll need to set the DocumentRoot directive so Apache knows where to ﬁnd your static ﬁles.

14.4.5 Error Handling

When you use Apache/mod_python, errors will be caught by Django – in other words, they won’t propagate to the

Apache level and won’t appear in the Apache error_log.

The exception to this is if something is really messed up in your Django setup. In that case, you’ll see an ominous

“Internal Server Error” page in your browser and the full Python traceback in your Apache error_log ﬁle. The

error_log traceback is spread over multiple lines. (Yes, this is ugly and rather hard to read, but it’s how mod_python

does things.)

14.4.6 Handling a Segmentation Fault

Sometimes, Apache segfaults when you install Django. When this happens, it’s almost always one of two causes

mostly unrelated to Django itself:

• It may be that your Python code is importing the pyexpat module (used for XML parsing), which may

conﬂict with the version embedded in Apache. For full information, see “Expat Causing Apache Crash” at

http://www.djangoproject.com/r/articles/expat-apache-crash/.

• It may be because you’re running mod_python and mod_php in the same Apache instance, with MySQL as

your database backend. In some cases, this causes a known mod_python issue due to version conﬂicts in

PHP and the Python MySQL back-end. There’s full information in a mod_python FAQ entry, accessible via

http://www.djangoproject.com/r/articles/php-modpython-faq/.

If you continue to have problems setting up mod_python, a good thing to do is get a bare-bones mod_python site

working, without the Django framework. This is an easy way to isolate mod_python-speciﬁc problems. The article

“Getting mod_python Working” details this procedure: http://www.djangoproject.com/r/articles/getting-modpython-

working/.

The next step should be to edit your test code and add an import of any Django-speciﬁc code you’re using – your views,

your models, your URLconf, your RSS conﬁguration, and so forth. Put these imports in your test handler function

and access your test URL in a browser. If this causes a crash, you’ve conﬁrmed it’s the importing of Django code that

causes the problem. Gradually reduce the set of imports until it stops crashing, so as to ﬁnd the speciﬁc module that

causes the problem. Drop down further into modules and look into their imports as necessary. For more help, system

tools like ldconfig on Linux, otool on Mac OS, and ListDLLs (from SysInternals) on Windows can help you

identify shared dependencies and possible version conﬂicts.

14.4.7 An Alternative: mod_wsgi

As an alternative to mod_python, you might consider using mod_wsgi (http://code.google.com/p/modwsgi/), which

has been developed more recently than mod_python and is getting some traction in the Django community. A full

overview is outside the scope of this book, but see the ofﬁcial Django documentation for more information.

14.5 Using Django with FastCGI

Although Django under Apache and mod_python is the most robust deployment setup, many people use shared host-

ing, on which FastCGI is the only available deployment option.

Additionally, in some situations, FastCGI allows better security and possibly better performance than mod_python.

For small sites, FastCGI can also be more lightweight than Apache.

188 Chapter 14. Chapter 12: Deploying Django

The Django Book, Release 2.0

14.5.1 FastCGI Overview

FastCGI is an efﬁcient way of letting an external application serve pages to a Web server. The Web server delegates

the incoming Web requests (via a socket) to FastCGI, which executes the code and passes the response back to the

Web server, which, in turn, passes it back to the client’s Web browser.

Like mod_python, FastCGI allows code to stay in memory, allowing requests to be served with no startup time. Unlike

mod_python, a FastCGI process doesn’t run inside the Web server process, but in a separate, persistent process.

Why Run Code in a Separate Process?

The traditional mod_*arrangements in Apache embed various scripting languages (most notably PHP,

Python/mod_python, and Perl/mod_perl) inside the process space of your Web server. Although this lowers startup

time (because code doesn’t have to be read off disk for every request), it comes at the cost of memory use.

Each Apache process gets a copy of the Apache engine, complete with all the features of Apache that Django simply

doesn’t take advantage of. FastCGI processes, on the other hand, only have the memory overhead of Python and

Django.

Due to the nature of FastCGI, it’s also possible to have processes that run under a different user account than the Web

server process. That’s a nice security beneﬁt on shared systems, because it means you can secure your code from other

users.

Before you can start using FastCGI with Django, you’ll need to install flup, a Python library for dealing with

FastCGI. Some users have reported stalled pages with older flup versions, so you may want to use the latest SVN

version. Get flup at http://www.djangoproject.com/r/ﬂup/.

14.5.2 Running Your FastCGI Server

FastCGI operates on a client/server model, and in most cases you’ll be starting the FastCGI server process on your

own. Your Web server (be it Apache, lighttpd, or otherwise) contacts your Django-FastCGI process only when the

server needs a dynamic page to be loaded. Because the daemon is already running with the code in memory, it’s able

to serve the response very quickly.

Note

If you’re on a shared hosting system, you’ll probably be forced to use Web server-managed FastCGI processes.

If you’re in this situation, you should read the section titled “Running Django on a Shared-Hosting Provider with

Apache,” below.

A Web server can connect to a FastCGI server in one of two ways: it can use either a Unix domain socket (a named

pipe on Win32 systems) or a TCP socket. What you choose is a manner of preference; a TCP socket is usually easier

due to permissions issues.

To start your server, ﬁrst change into the directory of your project (wherever your manage.py is), and then run

manage.py with the runfcgi command:

./manage.py runfcgi [options]

If you specify help as the only option after runfcgi, a list of all the available options will display.

You’ll need to specify either a socket or both host and port. Then, when you set up your Web server, you’ll just

need to point it at the socket or host/port you speciﬁed when starting the FastCGI server.

A few examples should help explain this:

• Running a threaded server on a TCP port:

14.5. Using Django with FastCGI 189

The Django Book, Release 2.0

./manage.py runfcgi method=threaded host=127.0.0.1 port=3033

• Running a preforked server on a Unix domain socket:

./manage.py runfcgi method=prefork socket=/home/user/mysite.sock pidfile=django.pid

• Run without daemonizing (backgrounding) the process (good for debugging):

./manage.py runfcgi daemonize=false socket=/tmp/mysite.sock

Stopping the FastCGI Daemon

If you have the process running in the foreground, it’s easy enough to stop it: simply press Ctrl+C to stop and quit the

FastCGI server. However, when you’re dealing with background processes, you’ll need to resort to the Unix kill

command.

If you specify the pidfile option to your manage.py runfcgi, you can kill the running FastCGI daemon like

this:

kill ‘cat $PIDFILE‘

where $PIDFILE is the pidfile you speciﬁed.

To easily restart your FastCGI daemon on Unix, you can use this small shell script:

#!/bin/bash

# Replace these three settings.

PROJDIR="/home/user/myproject"

PIDFILE="$PROJDIR/mysite.pid"

SOCKET="$PROJDIR/mysite.sock"

cd $PROJDIR

if [ -f $PIDFILE ]; then

kill ‘cat -- $PIDFILE‘

rm -f -- $PIDFILE

exec /usr/bin/env - \

PYTHONPATH="../python:.." \

./manage.py runfcgi socket=$SOCKET pidfile=$PIDFILE

14.5.3 Using Django with Apache and FastCGI

To use Django with Apache and FastCGI, you’ll need Apache installed and conﬁgured, with

mod_fastcgi installed and enabled. Consult the Apache and mod_fastcgi documentation for instructions:

http://www.djangoproject.com/r/mod_fastcgi/.

Once you’ve completed the setup, point Apache at your Django FastCGI instance by editing the httpd.conf

(Apache conﬁguration) ﬁle. You’ll need to do two things:

• Use the FastCGIExternalServer directive to specify the location of your FastCGI server.

• Use mod_rewrite to point URLs at FastCGI as appropriate.

190 Chapter 14. Chapter 12: Deploying Django

The Django Book, Release 2.0

Specifying the Location of the FastCGI Server

The FastCGIExternalServer directive tells Apache how to ﬁnd your FastCGI server. As the FastCGIExter-

nalServer docs (http://www.djangoproject.com/r/mod_fastcgi/FastCGIExternalServer/) explain, you can specify either

asocket or a host. Here are examples of both:

# Connect to FastCGI via a socket/named pipe:

FastCGIExternalServer /home/user/public_html/mysite.fcgi -socket /home/user/mysite.sock

# Connect to FastCGI via a TCP host/port:

FastCGIExternalServer /home/user/public_html/mysite.fcgi -host 127.0.0.1:3033

In either case, the the directory /home/user/public_html/ should exist, though the ﬁle

/home/user/public_html/mysite.fcgi doesn’t actually have to exist. It’s just a URL used by the

Web server internally – a hook for signifying which requests at a URL should be handled by FastCGI. (More on this

in the next section.)

Using mod_rewrite to Point URLs at FastCGI

The second step is telling Apache to use FastCGI for URLs that match a certain pattern. To do this, use the mod_rewrite

module and rewrite URLs to mysite.fcgi (or whatever you speciﬁed in the FastCGIExternalServer direc-

tive, as explained in the previous section).

In this example, we tell Apache to use FastCGI to handle any request that doesn’t represent a ﬁle on the ﬁlesystem and

doesn’t start with /media/. This is probably the most common case, if you’re using Django’s admin site:

ServerName example.com

DocumentRoot /home/user/public_html

Alias /media /home/user/python/django/contrib/admin/media

RewriteEngine On

RewriteRule ^/(media.*)$ /$1 [QSA,L]

RewriteCond %{REQUEST_FILENAME} !-f

RewriteRule ^/(.*)$ /mysite.fcgi/$1 [QSA,L]

</VirtualHost>

14.5.4 FastCGI and lighttpd

lighttpd (http://www.djangoproject.com/r/lighttpd/) is a lightweight Web server commonly used for serving static ﬁles.

It supports FastCGI natively and thus is also an ideal choice for serving both static and dynamic pages, if your site

doesn’t have any Apache-speciﬁc needs.

Make sure mod_fastcgi is in your modules list, somewhere after mod_rewrite and mod_access, but not after

mod_accesslog. You’ll probably want mod_alias as well, for serving admin media.

Add the following to your lighttpd conﬁg ﬁle:

server.document-root = "/home/user/public_html"

fastcgi.server = (

"/mysite.fcgi" => (

"main" => (

# Use host / port instead of socket for TCP fastcgi

# "host" => "127.0.0.1",

# "port" => 3033,

"socket" => "/home/user/mysite.sock",

"check-local" => "disable",

)

14.5. Using Django with FastCGI 191

The Django Book, Release 2.0

)

alias.url = (

"/media/" => "/home/user/django/contrib/admin/media/",

)

url.rewrite-once = (

"^(/media.*)$" => "$1",

"^/favicon\.ico$" => "/media/favicon.ico",

"^(/.*)$" => "/mysite.fcgi$1",

)

Running Multiple Django Sites on One lighttpd Instance

lighttpd lets you use “conditional conﬁguration” to allow conﬁguration to be customized per host. To specify multiple

FastCGI sites, just add a conditional block around your FastCGI conﬁg for each site:

# If the hostname is ’www.example1.com’...

$HTTP["host"] == "www.example1.com" {

server.document-root = "/foo/site1"

fastcgi.server = (

...

)

...

}

# If the hostname is ’www.example2.com’...

$HTTP["host"] == "www.example2.com" {

server.document-root = "/foo/site2"

fastcgi.server = (

...

)

...

}

You can also run multiple Django installations on the same site simply by specifying multiple entries in the

fastcgi.server directive. Add one FastCGI host for each.

14.5.5 Running Django on a Shared-Hosting Provider with Apache

Many shared-hosting providers don’t allow you to run your own server daemons or edit the httpd.conf ﬁle. In

these cases, it’s still possible to run Django using Web server-spawned processes.

Note

If you’re using Web server-spawned processes, as explained in this section, there’s no need for you to start the FastCGI

server on your own. Apache will spawn a number of processes, scaling as it needs to.

In your Web root directory, add this to a ﬁle named .htaccess

AddHandler fastcgi-script .fcgi

RewriteEngine On

RewriteCond %{REQUEST_FILENAME} !-f

RewriteRule ^(.*)$ mysite.fcgi/$1 [QSA,L]

192 Chapter 14. Chapter 12: Deploying Django

The Django Book, Release 2.0

Then, create a small script that tells Apache how to spawn your FastCGI program. Create a ﬁle, mysite.fcgi, and

place it in your Web directory, and be sure to make it executable

#!/usr/bin/python

import sys,os

# Add a custom Python path.

sys.path.insert(0,"/home/user/python")

# Switch to the directory of your project. (Optional.)

# os.chdir("/home/user/myproject")

# Set the DJANGO_SETTINGS_MODULE environment variable.

os.environ[’DJANGO_SETTINGS_MODULE’]="myproject.settings"

from django.core.servers.fastcgi import runfastcgi

runfastcgi(method="threaded", daemonize="false")

Restarting the Spawned Server

If you change any Python code on your site, you’ll need to tell FastCGI the code has changed. But there’s no need to

restart Apache in this case. Rather, just reupload mysite.fcgi – or edit the ﬁle – so that the timestamp on the ﬁle

changes. When Apache sees the ﬁle has been updated, it will restart your Django application for you.

If you have access to a command shell on a Unix system, you can accomplish this easily by using the touch command:

touch mysite.fcgi

14.6 Scaling

Now that you know how to get Django running on a single server, let’s look at how you can scale out a Django

installation. This section walks through how a site might scale from a single server to a large-scale cluster that could

serve millions of hits an hour.

It’s important to note, however, that nearly every large site is large in different ways, so scaling is anything but a one-

size-ﬁts-all operation. The following coverage should sufﬁce to show the general principle, and whenever possible

we’ll try to point out where different choices could be made.

First off, we’ll make a pretty big assumption and exclusively talk about scaling under Apache and mod_python.

Though we know of a number of successful medium- to large-scale FastCGI deployments, we’re much more familiar

with Apache.

14.6.1 Running on a Single Server

Most sites start out running on a single server, with an architecture that looks something like Figure 12-1.

This works just ﬁne for small- to medium-sized sites, and it’s relatively cheap – you can put together a single-server

site designed for Django for well under $3,000.

However, as trafﬁc increases you’ll quickly run into resource contention between the different pieces of software.

Database servers and Web servers love to have the entire server to themselves, so when run on the same server they

often end up “ﬁghting” over the same resources (RAM, CPU) that they’d prefer to monopolize.

This is solved easily by moving the database server to a second machine, as explained in the following section.

14.6. Scaling 193

The Django Book, Release 2.0

Figure 14.1: Figure 12-1: a single server Django setup.

14.6.2 Separating Out the Database Server

As far as Django is concerned, the process of separating out the database server is extremely easy: you’ll simply need

to change the DATABASE_HOST setting to the IP or DNS name of your database server. It’s probably a good idea to

use the IP if at all possible, as relying on DNS for the connection between your Web server and database server isn’t

recommended.

With a separate database server, our architecture now looks like Figure 12-2.

Here we’re starting to move into what’s usually called n-tier architecture. Don’t be scared by the buzzword – it just

refers to the fact that different “tiers” of the Web stack get separated out onto different physical machines.

At this point, if you anticipate ever needing to grow beyond a single database server, it’s probably a good idea to start

thinking about connection pooling and/or database replication. Unfortunately, there’s not nearly enough space to do

those topics justice in this book, so you’ll need to consult your database’s documentation and/or community for more

information.

14.6.3 Running a Separate Media Server

We still have a big problem left over from the single-server setup: the serving of media from the same box that handles

dynamic content.

Those two activities perform best under different circumstances, and by smashing them together on the same box you

end up with neither performing particularly well. So the next step is to separate out the media – that is, anything not

generated by a Django view – onto a dedicated server (see Figure 12-3).

Ideally, this media server should run a stripped-down Web server optimized for static media delivery. lighttpd and

tux (http://www.djangoproject.com/r/tux/) are both excellent choices here, but a heavily stripped down Apache could

work, too.

For sites heavy in static content (photos, videos, etc.), moving to a separate media server is doubly important and

should likely be the ﬁrst step in scaling up.

194 Chapter 14. Chapter 12: Deploying Django

The Django Book, Release 2.0

Figure 14.2: Figure 12-2: Moving the database onto a dedicated server.

Figure 14.3: Figure 12-3: Separating out the media server.

14.6. Scaling 195

The Django Book, Release 2.0

This step can be slightly tricky, however. If your application involves ﬁle uploads, Django needs to be able to write

uploaded media to the media server. If media lives on another server, you’ll need to arrange a way for that write to

happen across the network.

14.6.4 Implementing Load Balancing and Redundancy

At this point, we’ve broken things down as much as possible. This three-server setup should handle a very large

amount of trafﬁc – we served around 10 million hits a day from an architecture of this sort – so if you grow further,

you’ll need to start adding redundancy.

This is a good thing, actually. One glance at Figure 12-3 shows you that if even a single one of your three servers fails,

you’ll bring down your entire site. So as you add redundant servers, not only do you increase capacity, but you also

increase reliability.

For the sake of this example, let’s assume that the Web server hits capacity ﬁrst. It’s relatively easy to get multiple

copies of a Django site running on different hardware – just copy all the code onto multiple machines, and start Apache

on both of them.

However, you’ll need another piece of software to distribute trafﬁc over your multiple servers: a load balancer. You

can buy expensive and proprietary hardware load balancers, but there are a few high-quality open source software load

balancers out there.

Apache’s mod_proxy is one option, but we’ve found Perlbal (http://www.djangoproject.com/r/perlbal/) to be fantas-

tic. It’s a load balancer and reverse proxy written by the same folks who wrote memcached (see Chapter 15).

Note: If you’re using FastCGI, you can accomplish this same distribution/load balancing step by separating your

front-end Web servers and back-end FastCGI processes onto different machines. The front-end server essentially

becomes the load balancer, and the back-end FastCGI processes replace the Apache/mod_python/Django servers.

With the Web servers now clustered, our evolving architecture starts to look more complex, as shown in Figure 12-4.

Notice that in the diagram the Web servers are referred to as a “cluster” to indicate that the number of servers is

basically variable. Once you have a load balancer out front, you can easily add and remove back-end Web servers

without a second of downtime.

14.6.5 Going Big

At this point, the next few steps are pretty much derivatives of the last one:

• As you need more database performance, you might want to add replicated database servers. MySQL includes

built-in replication; PostgreSQL users should look into Slony (http://www.djangoproject.com/r/slony/) and pg-

pool (http://www.djangoproject.com/r/pgpool/) for replication and connection pooling, respectively.

• If the single load balancer isn’t enough, you can add more load balancer machines out front and distribute among

them using round-robin DNS.

• If a single media server doesn’t sufﬁce, you can add more media servers and distribute the load with your

load-balancing cluster.

• If you need more cache storage, you can add dedicated cache servers.

• At any stage, if a cluster isn’t performing well, you can add more servers to the cluster.

After a few of these iterations, a large-scale architecture might look like Figure 12-5.

Though we’ve shown only two or three servers at each level, there’s no fundamental limit to how many you can add.

196 Chapter 14. Chapter 12: Deploying Django

The Django Book, Release 2.0

Figure 14.4: Figure 12-4: A load-balanced, redundant server setup.

14.6. Scaling 197

The Django Book, Release 2.0

Figure 14.5: Figure 12-5. An example large-scale Django setup.

14.7 Performance Tuning

If you have huge amount of money, you can just keep throwing hardware at scaling problems. For the rest of us,

though, performance tuning is a must.

Note: Incidentally, if anyone with monstrous gobs of cash is actually reading this book, please consider a substantial

donation to the Django Foundation. We accept uncut diamonds and gold ingots, too.

Unfortunately, performance tuning is much more of an art than a science, and it is even more difﬁcult to write about

than scaling. If you’re serious about deploying a large-scale Django application, you should spend a great deal of time

learning how to tune each piece of your stack.

The following sections, though, present a few Django-speciﬁc tuning tips we’ve discovered over the years.

14.7.1 There’s No Such Thing As Too Much RAM

Even the really expensive RAM is relatively affordable these days. Buy as much RAM as you can possibly afford, and

then buy a little bit more.

Faster processors won’t improve performance all that much; most Web servers spend up to 90% of their time waiting

on disk I/O. As soon as you start swapping, performance will just die. Faster disks might help slightly, but they’re

much more expensive than RAM, such that it doesn’t really matter.

If you have multiple servers, the ﬁrst place to put your RAM is in the database server. If you can afford it, get enough

RAM to get ﬁt your entire database into memory. This shouldn’t be too hard; we’ve developed a site with more than

half a million newspaper articles, and it took under 2GB of space.

198 Chapter 14. Chapter 12: Deploying Django

The Django Book, Release 2.0

Next, max out the RAM on your Web server. The ideal situation is one where neither server swaps – ever. If you get

to that point, you should be able to withstand most normal trafﬁc.

14.7.2 Turn Off Keep-Alive

Keep-Alive is a feature of HTTP that allows multiple HTTP requests to be served over a single TCP connection,

avoiding the TCP setup/teardown overhead.

This looks good at ﬁrst glance, but it can kill the performance of a Django site. If you’re properly serving media from

a separate server, each user browsing your site will only request a page from your Django server every ten seconds or

so. This leaves HTTP servers waiting around for the next keep-alive request, and an idle HTTP server just consumes

RAM that an active one should be using.

14.7.3 Use memcached

Although Django supports a number of different cache back-ends, none of them even come close to being as fast as

memcached. If you have a high-trafﬁc site, don’t even bother with the other backends – go straight to memcached.

14.7.4 Use memcached Often

Of course, selecting memcached does you no good if you don’t actually use it. Chapter 15 is your best friend here:

learn how to use Django’s cache framework, and use it everywhere possible. Aggressive, preemptive caching is usually

the only thing that will keep a site up under major trafﬁc.

14.7.5 Join the Conversation

Each piece of the Django stack – from Linux to Apache to PostgreSQL or MySQL – has an awesome community

behind it. If you really want to get that last 1% out of your servers, join the open source communities behind your

software and ask for help. Most free-software community members will be happy to help.

And also be sure to join the Django community. Your humble authors are only two members of an incredibly active,

growing group of Django developers. Our community has a huge amount of collective experience to offer.

14.8 What’s Next?

The remaining chapters focus on other Django features that you may or may not need, depending on your application.

Feel free to read them in any order you choose.

14.8. What’s Next? 199

The Django Book, Release 2.0

200 Chapter 14. Chapter 12: Deploying Django

CHAPTER 15

Chapter 13: Generating Non-HTML Content

Usually when we talk about developing Web sites, we’re talking about producing HTML. Of course, there’s a lot more

to the Web than HTML; we use the Web to distribute data in all sorts of formats: RSS, PDFs, images, and so forth.

So far, we’ve focused on the common case of HTML production, but in this chapter we’ll take a detour and look at

using Django to produce other types of content.

Django has convenient built-in tools that you can use to produce some common non-HTML content:

• RSS/Atom syndication feeds

• Sitemaps (an XML format originally developed by Google that gives hints to search engines)

We’ll examine each of those tools a little later, but ﬁrst we’ll cover the basic principles.

15.1 The basics: views and MIME-types

Recall from Chapter 3 that a view function is simply a Python function that takes a Web request and returns a Web

response. This response can be the HTML contents of a Web page, or a redirect, or a 404 error, or an XML document,

or an image...or anything, really.

More formally, a Django view function must

• Accept an HttpRequest instance as its ﬁrst argument

• Return an HttpResponse instance

The key to returning non-HTML content from a view lies in the HttpResponse class, speciﬁcally the mimetype

argument. By tweaking the MIME type, we can indicate to the browser that we’ve returned a response of a different

format.

For example, let’s look at a view that returns a PNG image. To keep things simple, we’ll just read the ﬁle off the disk:

from django.http import HttpResponse

def my_image(request):

image_data =open("/path/to/my/image.png","rb").read()

return HttpResponse(image_data, mimetype="image/png")

That’s it! If you replace the image path in the open() call with a path to a real image, you can use this very simple

view to serve an image, and the browser will display it correctly.

The other important thing to keep in mind is that HttpResponse objects implement Python’s standard “ﬁle-like

object” API. This means that you can use an HttpResponse instance in any place Python (or a third-party library)

expects a ﬁle.

201

The Django Book, Release 2.0

For an example of how that works, let’s take a look at producing CSV with Django.

15.2 Producing CSV

CSV is a simple data format usually used by spreadsheet software. It’s basically a series of table rows, with each

cell in the row separated by a comma (CSV stands for comma-separated values). For example, here’s some data on

“unruly” airline passengers in CSV format:

Year,Unruly Airline Passengers

1995,146

1996,184

1997,235

1998,200

1999,226

2000,251

2001,299

2002,273

2003,281

2004,304

2005,203

2006,134

2007,147

Note: The preceding listing contains real numbers! They come from the U.S. Federal Aviation Administration.

Though CSV looks simple, its formatting details haven’t been universally agreed upon. Different pieces of software

produce and consume different variants of CSV, making it a bit tricky to use. Luckily, Python comes with a standard

CSV library, csv, that is pretty much bulletproof.

Because the csv module operates on ﬁle-like objects, it’s a snap to use an HttpResponse instead:

import csv

from django.http import HttpResponse

# Number of unruly passengers each year 1995 - 2005. In a real application

# this would likely come from a database or some other back-end data store.

UNRULY_PASSENGERS =[146,184,235,200,226,251,299,273,281,304,203]

def unruly_passengers_csv(request):

# Create the HttpResponse object with the appropriate CSV header.

response =HttpResponse(mimetype=’text/csv’)

response[’Content-Disposition’]=’attachment; filename=unruly.csv’

# Create the CSV writer using the HttpResponse as the "file."

writer =csv.writer(response)

writer.writerow([’Year’,’Unruly Airline Passengers’])

for (year, num) in zip(range(1995,2006), UNRULY_PASSENGERS):

writer.writerow([year, num])

return response

The code and comments should be pretty clear, but a few things deserve special mention:

• The response is given the text/csv MIME type (instead of the default text/html). This tells browsers that

the document is a CSV ﬁle.

202 Chapter 15. Chapter 13: Generating Non-HTML Content

The Django Book, Release 2.0

• The response gets an additional Content-Disposition header, which contains the name of the CSV ﬁle.

This header (well, the “attachment” part) will instruct the browser to prompt for a location to save the ﬁle instead

of just displaying it. This ﬁle name is arbitrary; call it whatever you want. It will be used by browsers in the

“Save As” dialog.

To assign a header on an HttpResponse, just treat the HttpResponse as a dictionary and set a key/value.

• Hooking into the CSV-generation API is easy: just pass response as the ﬁrst argument to csv.writer.

The csv.writer function expects a ﬁle-like object, and HttpResponse objects ﬁt the bill.

• For each row in your CSV ﬁle, call writer.writerow, passing it an iterable object such as a list or tuple.

• The CSV module takes care of quoting for you, so you don’t have to worry about escaping strings with quotes

or commas in them. Just pass information to writerow(), and it will do the right thing.

This is the general pattern you’ll use any time you need to return non-HTML content: create an HttpResponse

response object (with a special MIME type), pass it to something expecting a ﬁle, and then return the response.

Let’s look at a few more examples.

15.3 Generating PDFs

Portable Document Format (PDF) is a format developed by Adobe that’s used to represent printable documents, com-

plete with pixel-perfect formatting, embedded fonts, and 2D vector graphics. You can think of a PDF document as

the digital equivalent of a printed document; indeed, PDFs are often used in distributing documents for the purpose of

printing them.

You can easily generate PDFs with Python and Django thanks to the excellent open source ReportLab library

(http://www.reportlab.org/rl_toolkit.html). The advantage of generating PDF ﬁles dynamically is that you can cre-

ate customized PDFs for different purposes – say, for different users or different pieces of content.

For example, your humble authors used Django and ReportLab at KUSports.com to generate customized, printer-ready

NCAA tournament brackets.

15.3.1 Installing ReportLab

Before you do any PDF generation, however, you’ll need to install ReportLab. It’s usually simple: just download and

install the library from http://www.reportlab.org/downloads.html.

Note: If you’re using a modern Linux distribution, you might want to check your package management utility before

installing ReportLab. Most package repositories have added ReportLab.

For example, if you’re using Ubuntu, a simple apt-get install python-reportlab will do the trick nicely.

The user guide (naturally available only as a PDF ﬁle) at http://www.reportlab.org/rsrc/userguide.pdf has additional

installation instructions.

Test your installation by importing it in the Python interactive interpreter:

>>> import reportlab

If that command doesn’t raise any errors, the installation worked.

15.3.2 Writing Your View

Like CSV, generating PDFs dynamically with Django is easy because the ReportLab API acts on ﬁle-like objects.

15.3. Generating PDFs 203

The Django Book, Release 2.0

Here’s a “Hello World” example:

from reportlab.pdfgen import canvas

from django.http import HttpResponse

def hello_pdf(request):

# Create the HttpResponse object with the appropriate PDF headers.

response =HttpResponse(mimetype=’application/pdf’)

response[’Content-Disposition’]=’attachment; filename=hello.pdf’

# Create the PDF object, using the response object as its "file."

p=canvas.Canvas(response)

# Draw things on the PDF. Here’s where the PDF generation happens.

# See the ReportLab documentation for the full list of functionality.

p.drawString(100,100,"Hello world.")

# Close the PDF object cleanly, and we’re done.

p.showPage()

p.save()

return response

A few notes are in order:

• Here we use the application/pdf MIME type. This tells browsers that the document is a PDF ﬁle, rather

than an HTML ﬁle. If you leave off this information, browsers will probably interpret the response as HTML,

which will result in scary gobbledygook in the browser window.

• Hooking into the ReportLab API is easy: just pass response as the ﬁrst argument to canvas.Canvas. The

Canvas class expects a ﬁle-like object, and HttpResponse objects ﬁt the bill.

• All subsequent PDF-generation methods are called on the PDF object (in this case, p), not on response.

• Finally, it’s important to call showPage() and save() on the PDF ﬁle – or else, you’ll end up with a

corrupted PDF ﬁle.

15.3.3 Complex PDFs

If you’re creating a complex PDF document (or any large data blob), consider using the cStringIO library as

a temporary holding place for your PDF ﬁle. The cStringIO library provides a ﬁle-like object interface that is

written in C for maximum efﬁciency.

Here’s the previous “Hello World” example rewritten to use cStringIO:

from cStringIO import StringIO

from reportlab.pdfgen import canvas

from django.http import HttpResponse

def hello_pdf(request):

# Create the HttpResponse object with the appropriate PDF headers.

response =HttpResponse(mimetype=’application/pdf’)

response[’Content-Disposition’]=’attachment; filename=hello.pdf’

temp =StringIO()

# Create the PDF object, using the StringIO object as its "file."

p=canvas.Canvas(temp)

# Draw things on the PDF. Here’s where the PDF generation happens.

204 Chapter 15. Chapter 13: Generating Non-HTML Content

The Django Book, Release 2.0

# See the ReportLab documentation for the full list of functionality.

p.drawString(100,100,"Hello world.")

# Close the PDF object cleanly.

p.showPage()

p.save()

# Get the value of the StringIO buffer and write it to the response.

response.write(temp.getvalue())

return response

15.4 Other Possibilities

There’s a whole host of other types of content you can generate in Python. Here are a few more ideas and some

pointers to libraries you could use to implement them:

•ZIP ﬁles: Python’s standard library ships with the zipfile module, which can both read and write compressed

ZIP ﬁles. You could use it to provide on-demand archives of a bunch of ﬁles, or perhaps compress large

documents when requested. You could similarly produce TAR ﬁles using the standard library’s tarfile

module.

•Dynamic images: The Python Imaging Library (PIL; http://www.pythonware.com/products/pil/) is a fantastic

toolkit for producing images (PNG, JPEG, GIF, and a whole lot more). You could use it to automatically scale

down images into thumbnails, composite multiple images into a single frame, or even do Web-based image

processing.

•Plots and charts: There are a number of powerful Python plotting and charting libraries you could use to produce

on-demand maps, charts, plots, and graphs. We can’t possibly list them all, so here are a couple of the highlights:

•matplotlib (http://matplotlib.sourceforge.net/) can be used to produce the type of high-quality plots usually

generated with MatLab or Mathematica.

•pygraphviz (http://networkx.lanl.gov/pygraphviz/), an interface to the Graphviz graph layout toolkit

(http://graphviz.org/), can be used for generating structured diagrams of graphs and networks.

In general, any Python library capable of writing to a ﬁle can be hooked into Django. The possibilities are immense.

Now that we’ve looked at the basics of generating non-HTML content, let’s step up a level of abstraction. Django

ships with some pretty nifty built-in tools for generating some common types of non-HTML content.

15.5 The Syndication Feed Framework

Django comes with a high-level syndication-feed-generating framework that makes creating RSS and Atom feeds

easy.

What’s RSS? What’s Atom?

RSS and Atom are both XML-based formats you can use to provide automatically updating “feeds” of

your site’s content. Read more about RSS at http://www.whatisrss.com/, and get information on Atom at

http://www.atomenabled.org/.

To create any syndication feed, all you have to do is write a short Python class. You can create as many feeds as you

want.

15.4. Other Possibilities 205

The Django Book, Release 2.0

The high-level feed-generating framework is a view that’s hooked to /feeds/ by convention. Django uses the

remainder of the URL (everything after /feeds/) to determine which feed to return.

To create a feed, you’ll write a Feed class and point to it in your URLconf.

15.5.1 Initialization

To activate syndication feeds on your Django site, add this URLconf:

(r’^feeds/(?P<url>.*)/$’,’django.contrib.syndication.views.feed’,

{’feed_dict’: feeds}

This line tells Django to use the RSS framework to handle all URLs starting with "feeds/". (You can change that

"feeds/" preﬁx to ﬁt your own needs.)

This URLconf line has an extra argument: {’feed_dict’: feeds}. Use this extra argument to pass the syndi-

cation framework the feeds that should be published under that URL.

Speciﬁcally, feed_dict should be a dictionary that maps a feed’s slug (short URL label) to its Feed class. You can

deﬁne the feed_dict in the URLconf itself. Here’s a full example URLconf:

from django.conf.urls.defaults import *

from mysite.feeds import LatestEntries, LatestEntriesByCategory

feeds ={

’latest’: LatestEntries,

’categories’: LatestEntriesByCategory,

}

urlpatterns =patterns(’’,

# ...

(r’^feeds/(?P<url>.*)/$’,’django.contrib.syndication.views.feed’,

{’feed_dict’: feeds}),

# ...

)

The preceding example registers two feeds:

• The feed represented by LatestEntries will live at feeds/latest/.

• The feed represented by LatestEntriesByCategory will live at feeds/categories/.

Once that’s set up, you’ll need to deﬁne the Feed classes themselves.

AFeed class is a simple Python class that represents a syndication feed. A feed can be simple (e.g., a “site news”

feed, or a basic feed displaying the latest entries of a blog) or more complex (e.g., a feed displaying all the blog entries

in a particular category, where the category is variable).

Feed classes must subclass django.contrib.syndication.feeds.Feed. They can live anywhere in your

code tree.

15.5.2 A Simple Feed

This simple example describes a feed of the latest ﬁve blog entries for a given blog:

from django.contrib.syndication.feeds import Feed

from mysite.blog.models import Entry

206 Chapter 15. Chapter 13: Generating Non-HTML Content

The Django Book, Release 2.0

class LatestEntries(Feed):

title ="My Blog"

link ="/archive/"

description ="The latest news about stuff."

def items(self):

return Entry.objects.order_by(’-pub_date’)[:5]

The important things to notice here are as follows:

• The class subclasses django.contrib.syndication.feeds.Feed.

•title,link, and description correspond to the standard RSS <title>,<link>, and

<description> elements, respectively.

•items() is simply a method that returns a list of objects that should be included in the feed as <item>

elements. Although this example returns Entry objects using Django’s database API, items() doesn’t have

to return model instances.

There’s just one more step. In an RSS feed, each <item> has a <title>,<link>, and <description>. We

need to tell the framework what data to put into those elements.

• To specify the contents of <title> and <description>, create Django templates called

feeds/latest_title.html and feeds/latest_description.html, where latest is the

slug speciﬁed in the URLconf for the given feed. Note that the .html extension is required.

The RSS system renders that template for each item, passing it two template context variables:

–obj: The current object (one of whichever objects you returned in items()).

–site: A django.models.core.sites.Site object representing the current site. This is useful

for {{ site.domain }} or {{ site.name }}.

If you don’t create a template for either the title or description, the framework will use the template "{{ obj

}}" by default – that is, the normal string representation of the object. (For model objects, this will be the

__unicode__() method.

You can also change the names of these two templates by specifying title_template and

description_template as attributes of your Feed class.

• To specify the contents of <link>, you have two options. For each item in items(), Django ﬁrst tries

executing a get_absolute_url() method on that object. If that method doesn’t exist, it tries calling a

method item_link() in the Feed class, passing it a single parameter, item, which is the object itself.

Both get_absolute_url() and item_link() should return the item’s URL as a normal Python string.

• For the previous LatestEntries example, we could have very simple feed templates.

latest_title.html contains:

and latest_description.html contains:

It’s almost too easy...

15.5.3 A More Complex Feed

The framework also supports more complex feeds, via parameters.

15.5. The Syndication Feed Framework 207

The Django Book, Release 2.0

For example, say your blog offers an RSS feed for every distinct “tag” you’ve used to categorize your entries. It would

be silly to create a separate Feed class for each tag; that would violate the Don’t Repeat Yourself (DRY) principle

and would couple data to programming logic.

Instead, the syndication framework lets you make generic feeds that return items based on information in the feed’s

URL.

Your tag-speciﬁc feeds could use URLs like this:

•http://example.com/feeds/tags/python/: Returns recent entries tagged with “python”

•http://example.com/feeds/tags/cats/: Returns recent entries tagged with “cats”

The slug here is "tags". The syndication framework sees the extra URL bits after the slug – ’python’ and

’cats’ – and gives you a hook to tell it what those URL bits mean and how they should inﬂuence which items get

published in the feed.

An example makes this clear. Here’s the code for these tag-speciﬁc feeds:

from django.core.exceptions import ObjectDoesNotExist

from mysite.blog.models import Entry, Tag

class TagFeed(Feed):

def get_object(self, bits):

# In case of "/feeds/tags/cats/dogs/mice/", or other such

# clutter, check that bits has only one member.

if len(bits) != 1:

raise ObjectDoesNotExist

return Tag.objects.get(tag=bits[0])

def title(self, obj):

return "My Blog: Entries tagged with %s"%obj.tag

def link(self, obj):

return obj.get_absolute_url()

def description(self, obj):

return "Entries tagged with %s"%obj.tag

def items(self, obj):

entries =Entry.objects.filter(tags__id__exact=obj.id)

return entries.order_by(’-pub_date’)[:30]

Here’s the basic algorithm of the RSS framework, given this class and a request to the URL

/feeds/tags/python/:

1. The framework gets the URL /feeds/tags/python/ and notices there’s an extra bit of URL after the slug.

It splits that remaining string by the slash character ("/") and calls the Feed class’s get_object() method,

passing it the bits.

In this case, bits is [’python’]. For a request to /feeds/tags/python/django/, bits would be

[’python’, ’django’].

2. get_object() is responsible for retrieving the given Tag object, from the given bits.

In this case, it uses the Django database API to retrieve the Tag. Note that get_object() should

raise django.core.exceptions.ObjectDoesNotExist if given invalid parameters. There’s no

try/except around the Tag.objects.get() call, because it’s not necessary. That function raises

Tag.DoesNotExist on failure, and Tag.DoesNotExist is a subclass of ObjectDoesNotExist.

Raising ObjectDoesNotExist in get_object() tells Django to produce a 404 error for that request.

208 Chapter 15. Chapter 13: Generating Non-HTML Content

The Django Book, Release 2.0

3. To generate the feed’s <title>,<link>, and <description>, Django uses the title(),link(), and

description() methods. In the previous example, they were simple string class attributes, but this example

illustrates that they can be either strings or methods. For each of title,link, and description, Django

follows this algorithm:

(a) It tries to call a method, passing the obj argument, where obj is the object returned by get_object().

(b) Failing that, it tries to call a method with no arguments.

4. Finally, note that items() in this example also takes the obj argument. The algorithm for items is the same

as described in the previous step – ﬁrst, it tries items(obj), then items(), and then ﬁnally an items class

attribute (which should be a list).

Full documentation of all the methods and attributes of the Feed classes is always available from the ofﬁcial Django

documentation (http://docs.djangoproject.com/en/dev/ref/contrib/syndication/).

15.5.4 Specifying the Type of Feed

By default, the syndication framework produces RSS 2.0. To change that, add a feed_type attribute to your Feed

class:

from django.utils.feedgenerator import Atom1Feed

class MyFeed(Feed):

feed_type =Atom1Feed

Note that you set feed_type to a class object, not an instance. Currently available feed types are shown in Table

11-1.

Table 15.1: Table 11-1. Feed Types

Feed Class Format

django.utils.feedgenerator.Rss201rev2Feed RSS 2.01 (default)

django.utils.feedgenerator.RssUserland091Feed RSS 0.91

django.utils.feedgenerator.Atom1Feed Atom 1.0

15.5.5 Enclosures

To specify enclosures (i.e., media resources associated with a feed item such as MP3 podcast feeds), use the

item_enclosure_url,item_enclosure_length, and item_enclosure_mime_type hooks, for ex-

ample:

from myproject.models import Song

class MyFeedWithEnclosures(Feed):

title ="Example feed with enclosures"

link ="/feeds/example-with-enclosures/"

def items(self):

return Song.objects.all()[:30]

def item_enclosure_url(self, item):

return item.song_url

def item_enclosure_length(self, item):

15.5. The Syndication Feed Framework 209

The Django Book, Release 2.0

return item.song_length

item_enclosure_mime_type ="audio/mpeg"

This assumes, of course, that you’ve created a Song object with song_url and song_length (i.e., the size in

bytes) ﬁelds.

15.5.6 Language

Feeds created by the syndication framework automatically include the appropriate <language> tag (RSS 2.0) or

xml:lang attribute (Atom). This comes directly from your LANGUAGE_CODE setting.

15.5.7 URLs

The link method/attribute can return either an absolute URL (e.g., "/blog/") or a URL with the fully qualiﬁed

domain and protocol (e.g., "http://www.example.com/blog/"). If link doesn’t return the domain, the

syndication framework will insert the domain of the current site, according to your SITE_ID setting. (See Chapter

16 for more on SITE_ID and the sites framework.)

Atom feeds require a <link rel="self"> that deﬁnes the feed’s current location. The syndication framework

populates this automatically.

15.5.8 Publishing Atom and RSS Feeds in Tandem

Some developers like to make available both Atom and RSS versions of their feeds. That’s easy to do with Django:

just create a subclass of your feed class and set the feed_type to something different. Then update your URLconf

to add the extra versions. Here’s a full example:

from django.contrib.syndication.feeds import Feed

from django.utils.feedgenerator import Atom1Feed

from mysite.blog.models import Entry

class RssLatestEntries(Feed):

title ="My Blog"

link ="/archive/"

description ="The latest news about stuff."

def items(self):

return Entry.objects.order_by(’-pub_date’)[:5]

class AtomLatestEntries(RssLatestEntries):

feed_type =Atom1Feed

And here’s the accompanying URLconf:

from django.conf.urls.defaults import *

from myproject.feeds import RssLatestEntries, AtomLatestEntries

feeds ={

’rss’: RssLatestEntries,

’atom’: AtomLatestEntries,

}

urlpatterns =patterns(’’,

210 Chapter 15. Chapter 13: Generating Non-HTML Content

The Django Book, Release 2.0

# ...

(r’^feeds/(?P<url>.*)/$’,’django.contrib.syndication.views.feed’,

{’feed_dict’: feeds}),

# ...

)

15.6 The Sitemap Framework

Asitemap is an XML ﬁle on your Web site that tells search engine indexers how frequently your pages change and

how “important” certain pages are in relation to other pages on your site. This information helps search engines index

your site.

For example, here’s a piece of the sitemap for Django’s Web site (http://www.djangoproject.com/sitemap.xml):

<?xml version="1.0" encoding="UTF-8"?>

<url>

<loc>http://www.djangoproject.com/documentation/</loc>

<changefreq>weekly</changefreq>

</url>

<url>

<loc>http://www.djangoproject.com/documentation/0_90/</loc>

<changefreq>never</changefreq>

</url>

...

</urlset>

For more on sitemaps, see http://www.sitemaps.org/.

The Django sitemap framework automates the creation of this XML ﬁle by letting you express this information in

Python code. To create a sitemap, you just need to write a Sitemap class and point to it in your URLconf.

15.6.1 Installation

To install the sitemap application, follow these steps:

1. Add ’django.contrib.sitemaps’ to your INSTALLED_APPS setting.

2. Make sure ’django.template.loaders.app_directories.load_template_source’ is in

your TEMPLATE_LOADERS setting. It’s in there by default, so you’ll need to change this only if you’ve

changed that setting.

3. Make sure you’ve installed the sites framework (see Chapter 16).

Note: The sitemap application doesn’t install any database tables. The only reason it needs to go into

INSTALLED_APPS is so the load_template_source template loader can ﬁnd the default templates.

15.6.2 Initialization

To activate sitemap generation on your Django site, add this line to your URLconf:

15.6. The Sitemap Framework 211

The Django Book, Release 2.0

(r’^sitemap\.xml$’,’django.contrib.sitemaps.views.sitemap’, {’sitemaps’: sitemaps})

This line tells Django to build a sitemap when a client accesses /sitemap.xml. Note that the dot character in

sitemap.xml is escaped with a backslash, because dots have a special meaning in regular expressions.

The name of the sitemap ﬁle is not important, but the location is. Search engines will only index links in your sitemap

for the current URL level and below. For instance, if sitemap.xml lives in your root directory, it may reference any

URL in your site. However, if your sitemap lives at /content/sitemap.xml, it may only reference URLs that

begin with /content/.

The sitemap view takes an extra, required argument: {’sitemaps’: sitemaps}.sitemaps should be a

dictionary that maps a short section label (e.g., blog or news) to its Sitemap class (e.g., BlogSitemap or

NewsSitemap). It may also map to an instance of a Sitemap class (e.g., BlogSitemap(some_var)).

15.6.3 Sitemap Classes

ASitemap class is a simple Python class that represents a “section” of entries in your sitemap. For example, one

Sitemap class could represent all the entries of your weblog, while another could represent all of the events in your

events calendar.

In the simplest case, all these sections get lumped together into one sitemap.xml, but it’s also possible to use the

framework to generate a sitemap index that references individual sitemap ﬁles, one per section (as described shortly).

Sitemap classes must subclass django.contrib.sitemaps.Sitemap. They can live anywhere in your code

tree.

For example, let’s assume you have a blog system, with an Entry model, and you want your sitemap to include all

the links to your individual blog entries. Here’s how your Sitemap class might look:

from django.contrib.sitemaps import Sitemap

from mysite.blog.models import Entry

class BlogSitemap(Sitemap):

changefreq ="never"

priority =0.5

def items(self):

return Entry.objects.filter(is_draft=False)

def lastmod(self, obj):

return obj.pub_date

Declaring a Sitemap should look very similar to declaring a Feed. That’s by design.

Like Feed classes, Sitemap members can be either methods or attributes. See the steps in the earlier “A Complex

Example” section for more about how this works.

ASitemap class can deﬁne the following methods/attributes:

•items (required): Provides list of objects. The framework doesn’t care what type of objects they are;

all that matters is that these objects get passed to the location(),lastmod(),changefreq(), and

priority() methods.

•location (optional): Gives the absolute URL for a given object. Here, “absolute URL” means a URL that

doesn’t include the protocol or domain. Here are some examples:

–Good: ’/foo/bar/’

–Bad: ’example.com/foo/bar/’

212 Chapter 15. Chapter 13: Generating Non-HTML Content

The Django Book, Release 2.0

–Bad: ’http://example.com/foo/bar/’

If location isn’t provided, the framework will call the get_absolute_url() method on each object as

returned by items().

•lastmod (optional): The object’s “last modiﬁcation” date, as a Python datetime object.

•changefreq (optional): How often the object changes. Possible values (as given by the Sitemaps speciﬁca-

tion) are as follows:

–’always’

–’hourly’

–’daily’

–’weekly’

–’monthly’

–’yearly’

–’never’

•priority (optional): A suggested indexing priority between 0.0 and 1.0. The default priority of a page is

0.5; see the http://sitemaps.org/ documentation for more about how priority works.

15.6.4 Shortcuts

The sitemap framework provides a couple convenience classes for common cases. These are described in the sections

that follow.

FlatPageSitemap

The django.contrib.sitemaps.FlatPageSitemap class looks at all ﬂat pages deﬁned for the current

site and creates an entry in the sitemap. These entries include only the location attribute – not lastmod,

changefreq, or priority.

See Chapter 16 for more about ﬂat pages.

GenericSitemap

The GenericSitemap class works with any generic views (see Chapter 11) you already have.

To use it, create an instance, passing in the same info_dict you pass to the generic views. The only requirement

is that the dictionary have a queryset entry. It may also have a date_field entry that speciﬁes a date ﬁeld for

objects retrieved from the queryset. This will be used for the lastmod attribute in the generated sitemap. You

may also pass priority and changefreq keyword arguments to the GenericSitemap constructor to specify

these attributes for all URLs.

Here’s an example of a URLconf using both FlatPageSitemap and GenericSiteMap (with the hypothetical

Entry object from earlier):

from django.conf.urls.defaults import *

from django.contrib.sitemaps import FlatPageSitemap, GenericSitemap

from mysite.blog.models import Entry

info_dict ={

’queryset’: Entry.objects.all(),

15.6. The Sitemap Framework 213

The Django Book, Release 2.0

’date_field’:’pub_date’,

}

sitemaps ={

’flatpages’: FlatPageSitemap,

’blog’: GenericSitemap(info_dict, priority=0.6),

}

urlpatterns =patterns(’’,

# some generic view using info_dict

# ...

# the sitemap

(r’^sitemap\.xml$’,

’django.contrib.sitemaps.views.sitemap’,

{’sitemaps’: sitemaps})

)

15.6.5 Creating a Sitemap Index

The sitemap framework also has the ability to create a sitemap index that references individual sitemap ﬁles, one per

each section deﬁned in your sitemaps dictionary. The only differences in usage are as follows:

• You use two views in your URLconf: django.contrib.sitemaps.views.index and

django.contrib.sitemaps.views.sitemap.

• The django.contrib.sitemaps.views.sitemap view should take a section keyword argument.

Here is what the relevant URLconf lines would look like for the previous example:

(r’^sitemap.xml$’,

’django.contrib.sitemaps.views.index’,

{’sitemaps’: sitemaps}),

(r’^sitemap-(?P<section>.+).xml$’,

’django.contrib.sitemaps.views.sitemap’,

{’sitemaps’: sitemaps})

This will automatically generate a sitemap.xml ﬁle that references both sitemap-flatpages.xml and

sitemap-blog.xml. The Sitemap classes and the sitemaps dictionary don’t change at all.

15.6.6 Pinging Google

You may want to “ping” Google when your sitemap changes, to let it know to reindex your site. The framework

provides a function to do just that: django.contrib.sitemaps.ping_google().

ping_google() takes an optional argument, sitemap_url, which should be the absolute URL of your site’s

sitemap (e.g., ’/sitemap.xml’). If this argument isn’t provided, ping_google() will attempt to ﬁgure out

your sitemap by performing a reverse lookup on your URLconf.

ping_google() raises the exception django.contrib.sitemaps.SitemapNotFound if it cannot deter-

mine your sitemap URL.

One useful way to call ping_google() is from a model’s save() method:

from django.contrib.sitemaps import ping_google

214 Chapter 15. Chapter 13: Generating Non-HTML Content

The Django Book, Release 2.0

class Entry(models.Model):

# ...

def save(self,*args, **kwargs):

super(Entry, self).save(*args, **kwargs)

try:

ping_google()

except Exception:

# Bare ’except’ because we could get a variety

# of HTTP-related exceptions.

pass

A more efﬁcient solution, however, would be to call ping_google() from a cron script or some other scheduled

task. The function makes an HTTP request to Google’s servers, so you may not want to introduce that network

overhead each time you call save().

Finally, if ’django.contrib.sitemaps’ is in your INSTALLED_APPS, then your manage.py will include

a new command, ping_google. This is useful for command-line access to pinging. For example:

python manage.py ping_google /sitemap.xml

15.7 What’s Next?

Next, we’ll continue to dig deeper into the built-in tools Django gives you. Chapter 14 looks at all the tools you need

to provide user-customized sites: sessions, users, and authentication.

15.7. What’s Next? 215

The Django Book, Release 2.0

216 Chapter 15. Chapter 13: Generating Non-HTML Content

CHAPTER 16

Chapter 14: Sessions, Users, and Registration

It’s time for a confession: we’ve been deliberately ignoring an important aspect of Web development prior to this

point. So far, we’ve thought of the trafﬁc visiting our sites as some faceless, anonymous mass hurtling itself against

our carefully designed pages.

This isn’t true, of course. The browsers hitting our sites have real humans behind them (most of the time, at least).

That’s a big thing to ignore: the Internet is at its best when it serves to connect people, not machines. If we’re going to

develop truly compelling sites, eventually we’re going to have to deal with the bodies behind the browsers.

Unfortunately, it’s not all that easy. HTTP is designed to be stateless– that is, each and every request happens in a

vacuum. There’s no persistence between one request and the next, and we can’t count on any aspects of a request (IP

address, user agent, etc.) to consistently indicate successive requests from the same person.

In this chapter you’ll learn how to handle this lack of state. We’ll start at the lowest level (cookies), and work up to the

high-level tools for handling sessions, users and registration.

16.1 Cookies

Browser developers long ago recognized that HTTP’s statelessness poses a huge problem for Web developers, and

thus cookies were born. A cookie is a small piece of information that browsers store on behalf of Web servers. Every

time a browser requests a page from a certain server, it gives back the cookie that it initially received.

Let’s take a look how this might work. When you open your browser and type in google.com, your browser sends

an HTTP request to Google that starts something like this:

GET / HTTP/1.1

Host: google.com

...

When Google replies, the HTTP response looks something like the following:

HTTP/1.1 200 OK

Content-Type: text/html

Set-Cookie: PREF=ID=5b14f22bdaf1e81c:TM=1167000671:LM=1167000671;

expires=Sun, 17-Jan-2038 19:14:07 GMT;

path=/; domain=.google.com

Server: GWS/2.1

...

Notice the Set-Cookie header. Your browser will store that cookie value

(PREF=ID=5b14f22bdaf1e81c:TM=1167000671:LM=1167000671) and serve it back to Google ev-

ery time you access the site. So the next time you access Google, your browser is going to send a request like

this:

217

The Django Book, Release 2.0

GET / HTTP/1.1

Host: google.com

Cookie: PREF=ID=5b14f22bdaf1e81c:TM=1167000671:LM=1167000671

...

Google then can use that Cookie value to know that you’re the same person who accessed the site earlier. This value

might, for example, be a key into a database that stores user information. Google could (and does) use it to display

your account’s username on the page.

16.1.1 Getting and Setting Cookies

When dealing with persistence in Django, most of the time you’ll want to use the higher-level session and/or user

frameworks discussed a little later in this chapter. However, ﬁrst look at how to read and write cookies at a low level.

This should help you understand how the rest of the tools discussed in the chapter actually work, and it will come in

handy if you ever need to play with cookies directly.

Reading cookies that are already set is simple. Every HttpRequest object has a COOKIES object that acts like a

dictionary; you can use it to read any cookies that the browser has sent to the view:

def show_color(request):

if "favorite_color" in request.COOKIES:

return HttpResponse("Your favorite color is %s"%\

request.COOKIES["favorite_color"])

else:

return HttpResponse("You don’t have a favorite color.")

Writing cookies is slightly more complicated. You need to use the set_cookie() method on an HttpResponse

object. Here’s an example that sets the favorite_color cookie based on a GET parameter:

def set_color(request):

if "favorite_color" in request.GET:

# Create an HttpResponse object...

response =HttpResponse("Your favorite color is now %s"%\

request.GET["favorite_color"])

# ... and set a cookie on the response

response.set_cookie("favorite_color",

request.GET["favorite_color"])

return response

else:

return HttpResponse("You didn’t give a favorite color.")

You can also pass a number of optional arguments to response.set_cookie() that control aspects of the cookie,

as shown in Table 14-1.

218 Chapter 16. Chapter 14: Sessions, Users, and Registration

The Django Book, Release 2.0

Table 16.1: Table 14-1: Cookie options

Pa-

rame-

ter

De-

fault

Description

max_ageNone Age (in seconds) that the cookie should last. If this parameter is None, the cookie will last only

until the browser is closed.

expiresNone The actual date/time when the cookie should expire. It needs to be in the format "Wdy,

DD-Mth-YY HH:MM:SS GMT". If given, this parameter overrides the max_age parameter.

path "/" The path preﬁx that this cookie is valid for. Browsers will only pass the cookie back to pages

below this path preﬁx, so you can use this to prevent cookies from being sent to other sections

of your site.

This is especially useful when you don’t control the top level of your site’s domain.

domain None The domain that this cookie is valid for. You can use this parameter to set a cross-domain

cookie. For example, domain=".example.com" will set a cookie that is readable by the

domains www.example.com,www2.example.com, and

an.other.sub.domain.example.com.

If this parameter is set to None, a cookie will only be readable by the domain that set it.

secure False If set to True, this parameter instructs the browser to only return this cookie to pages accessed

over HTTPS.

16.1.2 The Mixed Blessing of Cookies

You might notice a number of potential problems with the way cookies work. Let’s look at some of the more important

ones:

• Storage of cookies is voluntary; a client does not have to accept or store cookies. In fact, all browsers enable

users to control the policy for accepting cookies. If you want to see just how vital cookies are to the Web, try

turning on your browser’s “prompt to accept every cookie” option.

Despite their nearly universal use, cookies are still the deﬁnition of unreliability. This means that developers

should check that a user actually accepts cookies before relying on them.

• Cookies (especially those not sent over HTTPS) are not secure. Because HTTP data is sent in cleartext, cookies

are extremely vulnerable to snooping attacks. That is, an attacker snooping on the wire can intercept a cookie

and read it. This means you should never store sensitive information in a cookie.

There’s an even more insidious attack, known as a man-in-the-middle attack, wherein an attacker intercepts a

cookie and uses it to pose as another user. Chapter 20 discusses attacks of this nature in depth, as well as ways

to prevent it.

• Cookies aren’t even secure from their intended recipients. Most browsers provide easy ways to

edit the content of individual cookies, and resourceful users can always use tools like mechanize

(http://wwwsearch.sourceforge.net/mechanize/) to construct HTTP requests by hand.

So you can’t store data in cookies that might be sensitive to tampering. The canonical mistake in this scenario

is storing something like IsLoggedIn=1 in a cookie when a user logs in. You’d be amazed at the number of

sites that make mistakes of this nature; it takes only a second to fool these sites’ “security” systems.

16.2 Django’s Session Framework

With all of these limitations and potential security holes, it’s obvious that cookies and persistent sessions are examples

of those “pain points” in Web development. Of course, Django’s goal is to be an effective painkiller, so it comes with

a session framework designed to smooth over these difﬁculties for you.

16.2. Django’s Session Framework 219

The Django Book, Release 2.0

This session framework lets you store and retrieve arbitrary data on a per-site visitor basis. It stores data on the server

side and abstracts the sending and receiving of cookies. Cookies use only a hashed session ID – not the data itself –

thus protecting you from most of the common cookie problems.

Let’s look at how to enable sessions and use them in views.

16.2.1 Enabling Sessions

Sessions are implemented via a piece of middleware (see Chapter 17) and a Django model. To enable sessions, you’ll

need to follow these steps:

1. Edit your MIDDLEWARE_CLASSES setting and make sure MIDDLEWARE_CLASSES contains

’django.contrib.sessions.middleware.SessionMiddleware’.

2. Make sure ’django.contrib.sessions’ is in your INSTALLED_APPS setting (and run manage.py

syncdb if you have to add it).

The default skeleton settings created by startproject have both of these bits already installed, so unless you’ve

removed them, you probably don’t have to change anything to get sessions to work.

If you don’t want to use sessions, you might want to remove the SessionMiddleware line from

MIDDLEWARE_CLASSES and ’django.contrib.sessions’ from your INSTALLED_APPS. It will save you

only a small amount of overhead, but every little bit counts.

16.2.2 Using Sessions in Views

When SessionMiddleware is activated, each HttpRequest object – the ﬁrst argument to any Django view

function – will have a session attribute, which is a dictionary-like object. You can read it and write to it in the same

way you’d use a normal dictionary. For example, in a view you could do stuff like this:

# Set a session value:

request.session["fav_color"]="blue"

# Get a session value -- this could be called in a different view,

# or many requests later (or both):

fav_color =request.session["fav_color"]

# Clear an item from the session:

del request.session["fav_color"]

# Check if the session has a given key:

if "fav_color" in request.session:

...

You can also use other dictionary methods like keys() and items() on request.session.

There are a couple of simple rules for using Django’s sessions effectively:

• Use normal Python strings as dictionary keys on request.session (as opposed to integers, objects, etc.).

• Session dictionary keys that begin with an underscore are reserved for internal use by Django. In practice, the

framework uses only a small number of underscore-preﬁxed session variables, but unless you know what they

all are (and you are willing to keep up with any changes in Django itself), staying away from underscore preﬁxes

will keep Django from interfering with your application.

For example, don’t use a session key called _fav_color, like this:

request.session[’_fav_color’]=’blue’ # Don’t do this!

220 Chapter 16. Chapter 14: Sessions, Users, and Registration

The Django Book, Release 2.0

• Don’t replace request.session with a new object, and don’t access or set its attributes. Use it like a Python

dictionary. Examples:

request.session =some_other_object # Don’t do this!

request.session.foo =’bar’ # Don’t do this!

Let’s take a look at a few quick examples. This simplistic view sets a has_commented variable to True after a

user posts a comment. It’s a simple (if not particularly secure) way of preventing a user from posting more than one

comment:

def post_comment(request):

if request.method != ’POST’:

raise Http404(’Only POSTs are allowed’)

if ’comment’ not in request.POST:

raise Http404(’Comment not submitted’)

if request.session.get(’has_commented’,False):

return HttpResponse("You’ve already commented.")

c=comments.Comment(comment=request.POST[’comment’])

c.save()

request.session[’has_commented’]=True

return HttpResponse(’Thanks for your comment!’)

This simplistic view logs in a “member” of the site:

def login(request):

if request.method != ’POST’:

raise Http404(’Only POSTs are allowed’)

try:

m=Member.objects.get(username=request.POST[’username’])

if m.password == request.POST[’password’]:

request.session[’member_id’]=m.id

return HttpResponseRedirect(’/you-are-logged-in/’)

except Member.DoesNotExist:

return HttpResponse("Your username and password didn’t match.")

And this one logs out a member who has been logged in via login() above:

def logout(request):

try:

del request.session[’member_id’]

except KeyError:

pass

return HttpResponse("You’re logged out.")

Note: In practice, this is a lousy way of logging users in. The authentication framework discussed shortly handles

this task for you in a much more robust and useful manner. These examples are deliberately simplistic so that you can

easily see what’s going on.

16.2.3 Setting Test Cookies

As mentioned above, you can’t rely on every browser accepting cookies. So, as a convenience, Django provides an

easy way to test whether a user’s browser accepts cookies. Just call request.session.set_test_cookie()

16.2. Django’s Session Framework 221

The Django Book, Release 2.0

in a view, and check request.session.test_cookie_worked() in a subsequent view – not in the same

view call.

This awkward split between set_test_cookie() and test_cookie_worked() is necessary due to the way

cookies work. When you set a cookie, you can’t actually tell whether a browser accepted it until the browser’s next

request.

It’s good practice to use delete_test_cookie() to clean up after yourself. Do this after you’ve veriﬁed that the

test cookie worked.

Here’s a typical usage example:

def login(request):

# If we submitted the form...

if request.method == ’POST’:

# Check that the test cookie worked (we set it below):

if request.session.test_cookie_worked():

# The test cookie worked, so delete it.

request.session.delete_test_cookie()

# In practice, we’d need some logic to check username/password

# here, but since this is an example...

return HttpResponse("You’re logged in.")

# The test cookie failed, so display an error message. If this

# were a real site, we’d want to display a friendlier message.

else:

return HttpResponse("Please enable cookies and try again.")

# If we didn’t post, send the test cookie along with the login form.

request.session.set_test_cookie()

return render(request, ’foo/login_form.html’)

Note: Again, the built-in authentication functions handle this check for you.

16.2.4 Using Sessions Outside of Views

Internally, each session is just a normal Django model deﬁned in django.contrib.sessions.models. Each

session is identiﬁed by a more-or-less random 32-character hash stored in a cookie. Because it’s a normal model, you

can access sessions using the normal Django database API:

>>> from django.contrib.sessions.models import Session

>>> s=Session.objects.get(pk=’2b1189a188b44ad18c35e113ac6ceead’)

>>> s.expire_date

datetime.datetime(2005, 8, 20, 13, 35, 12)

You’ll need to call get_decoded() to get the actual session data. This is necessary because the dictionary is stored

in an encoded format:

>>> s.session_data

’KGRwMQpTJ19hdXRoX3VzZXJfaWQnCnAyCkkxCnMuMTExY2ZjODI2Yj...’

>>> s.get_decoded()

{’user_id’: 42}

222 Chapter 16. Chapter 14: Sessions, Users, and Registration

The Django Book, Release 2.0

16.2.5 When Sessions Are Saved

By default, Django only saves to the database if the session has been modiﬁed – that is, if any of its dictionary values

have been assigned or deleted:

# Session is modified.

request.session[’foo’]=’bar’

# Session is modified.

del request.session[’foo’]

# Session is modified.

request.session[’foo’]={}

# Gotcha: Session is NOT modified, because this alters

# request.session[’foo’] instead of request.session.

request.session[’foo’][’bar’]=’baz’

To change this default behavior, set SESSION_SAVE_EVERY_REQUEST to True. If

SESSION_SAVE_EVERY_REQUEST is True, Django will save the session to the database on every single

request, even if it wasn’t changed.

Note that the session cookie is sent only when a session has been created or modiﬁed. If

SESSION_SAVE_EVERY_REQUEST is True, the session cookie will be sent on every request. Similarly, the

expires part of a session cookie is updated each time the session cookie is sent.

16.2.6 Browser-Length Sessions vs. Persistent Sessions

You might have noticed that the cookie Google sent us at the beginning of this chapter contained expires=Sun,

17-Jan-2038 19:14:07 GMT;. Cookies can optionally contain an expiration date that advises the browser on

when to remove the cookie. If a cookie doesn’t contain an expiration value, the browser will expire it when the

user closes his or her browser window. You can control the session framework’s behavior in this regard with the

SESSION_EXPIRE_AT_BROWSER_CLOSE setting.

By default, SESSION_EXPIRE_AT_BROWSER_CLOSE is set to False, which means session cookies will be stored

in users’ browsers for SESSION_COOKIE_AGE seconds (which defaults to two weeks, or 1,209,600 seconds). Use

this if you don’t want people to have to log in every time they open a browser.

If SESSION_EXPIRE_AT_BROWSER_CLOSE is set to True, Django will use browser-length cookies.

16.2.7 Other Session Settings

Besides the settings already mentioned, a few other settings inﬂuence how Django’s session framework uses cookies,

as shown in Table 14-2.

Table 16.2: Table 14-2. Settings that inﬂuence cookie behavior

Setting Description Default

SESSION_COOKIE_DOMAINThe domain to use for session cookies. Set this to a string such as

".example.com" for cross-domain cookies, or use None for a standard cookie.

None

SESSION_COOKIE_NAMEThe name of the cookie to use for sessions. This can be any string. "sessionid"

SESSION_COOKIE_SECUREWhether to use a “secure” cookie for the session cookie. If this is set to True, the

cookie will be marked as “secure,” which means that browsers will ensure that the

cookie is only sent via HTTPS.

False

Technical Details

16.2. Django’s Session Framework 223

The Django Book, Release 2.0

For the curious, here are a few technical notes about the inner workings of the session framework:

• The session dictionary accepts any Python object capable of being “pickled.” See the documentation for

Python’s built-in pickle module for information about how this works.

• Session data is stored in a database table named django_session.

• Session data is fetched upon demand. If you never access request.session, Django won’t hit that database

table.

• Django only sends a cookie if it needs to. If you don’t set any session data, it won’t send a session cookie (unless

SESSION_SAVE_EVERY_REQUEST is set to True).

• The Django sessions framework is entirely, and solely, cookie based. It does not fall back to putting session IDs

in URLs as a last resort, as some other tools (PHP, JSP) do.

This is an intentional design decision. Putting sessions in URLs don’t just make URLs ugly, but also make your

site vulnerable to a certain form of session ID theft via the Referer header.

If you’re still curious, the source is pretty straightforward; look in django.contrib.sessions for more details.

16.3 Users and Authentication

Sessions give us a way of persisting data through multiple browser requests; the second part of the equation is using

those sessions for user login. Of course, we can’t just trust that users are who they say they are, so we need to

authenticate them along the way.

Naturally, Django provides tools to handle this common task (and many others). Django’s user authentication system

handles user accounts, groups, permissions, and cookie-based user sessions. This system is often referred to as an

auth/auth (authentication and authorization) system. That name recognizes that dealing with users is often a two-step

process. We need to

1. Verify (authenticate) that a user is who he or she claims to be (usually by checking a username and password

against a database of users)

2. Verify that the user is authorized to perform some given operation (usually by checking against a table of

permissions)

Following these needs, Django’s auth/auth system consists of a number of parts:

•Users: People registered with your site

•Permissions: Binary (yes/no) ﬂags designating whether a user may perform a certain task

•Groups: A generic way of applying labels and permissions to more than one user

•Messages: A simple way to queue and display system messages to users

If you’ve used the admin tool (discussed in Chapter 6), you’ve already seen many of these tools, and if you’ve edited

users or groups in the admin tool, you’ve actually been editing data in the auth system’s database tables.

16.3.1 Enabling Authentication Support

Like the session tools, authentication support is bundled as a Django application in django.contrib that needs to

be installed. Also like the session tools, it’s also installed by default, but if you’ve removed it, you’ll need to follow

these steps to install it:

1. Make sure the session framework is installed as described earlier in this chapter. Keeping track of users obvi-

ously requires cookies, and thus builds on the session framework.

224 Chapter 16. Chapter 14: Sessions, Users, and Registration

The Django Book, Release 2.0

2. Put ’django.contrib.auth’ in your INSTALLED_APPS setting and run manage.py syncdb to in-

stall the appropriate database tables.

3. Make sure that ’django.contrib.auth.middleware.AuthenticationMiddleware’ is in your

MIDDLEWARE_CLASSES setting – after SessionMiddleware.

With that installation out of the way, we’re ready to deal with users in view functions. The main interface you’ll use

to access users within a view is request.user; this is an object that represents the currently logged-in user. If the

user isn’t logged in, this will instead be an AnonymousUser object (see below for more details).

You can easily tell if a user is logged in with the is_authenticated() method:

if request.user.is_authenticated():

# Do something for authenticated users.

else:

# Do something for anonymous users.

16.3.2 Using Users

Once you have a User – often from request.user, but possibly through one of the other methods discussed

shortly – you have a number of ﬁelds and methods available on that object. AnonymousUser objects emulate some

of this interface, but not all of it, so you should always check user.is_authenticated() before assuming

you’re dealing with a bona ﬁde user object. Tables 14-3 and 14-4 list the ﬁelds and methods, respectively, on User

objects.

Table 16.3: Table 14-3. Fields on User Objects

Field Description

username Required; 30 characters or fewer. Alphanumeric characters only (letters, digits, and underscores).

first_nameOptional; 30 characters or fewer.

last_name Optional; 30 characters or fewer.

email Optional. E-mail address.

password Required. A hash of, and metadata about, the password (Django doesn’t store the raw password).

See the “Passwords” section for more about this value.

is_staff Boolean. Designates whether this user can access the admin site.

is_active Boolean. Designates whether this account can be used to log in. Set this ﬂag to False instead of

deleting accounts.

is_superuserBoolean. Designates that this user has all permissions without explicitly assigning them.

last_loginA datetime of the user’s last login. This is set to the current date/time by default.

date_joinedA datetime designating when the account was created. This is set to the current date/time by default

when the account is created.

16.3. Users and Authentication 225

The Django Book, Release 2.0

Table 16.4: Table 14-4. Methods on User Objects

Method Description

is_authenticated()Always returns True for “real” User objects. This is a way to tell if the user has been

authenticated. This does not imply any permissions, and it doesn’t check if the user is

active. It only indicates that the user has sucessfully authenticated.

is_anonymous() Returns True only for AnonymousUser objects (and False for “real” User

objects). Generally, you should prefer using is_authenticated() to this method.

get_full_name() Returns the first_name plus the last_name, with a space in between.

set_password(passwd)Sets the user’s password to the given raw string, taking care of the password hashing.

This doesn’t actually save the User object.

check_password(passwd)Returns True if the given raw string is the correct password for the user. This takes care

of the password hashing in making the comparison.

get_group_permissions()Returns a list of permission strings that the user has through the groups he or she belongs

to.

get_all_permissions()Returns a list of permission strings that the user has, both through group and user

permissions.

has_perm(perm) Returns True if the user has the speciﬁed permission, where perm is in the format

"package.codename". If the user is inactive, this method will always return False.

has_perms(perm_list)Returns True if the user has all of the speciﬁed permissions. If the user is inactive, this

method will always return False.

has_module_perms(app_label)Returns True if the user has any permissions in the given app_label. If the user is

inactive, this method will always return False.

get_and_delete_messages()Returns a list of Message objects in the user’s queue and deletes the messages from the

queue.

email_user(subj,

msg)

Sends an email to the user. This email is sent from the DEFAULT_FROM_EMAIL setting.

You can also pass a third argument, from_email, to override the From address on the

email.

Finally, User objects have two many-to-many ﬁelds: groups and permissions.User objects can access their

related objects in the same way as any other many-to-many ﬁeld:

# Set a user’s groups:

myuser.groups =group_list

# Add a user to some groups:

myuser.groups.add(group1, group2,...)

# Remove a user from some groups:

myuser.groups.remove(group1, group2,...)

# Remove a user from all groups:

myuser.groups.clear()

# Permissions work the same way

myuser.permissions =permission_list

myuser.permissions.add(permission1, permission2, ...)

myuser.permissions.remove(permission1, permission2, ...)

myuser.permissions.clear()

16.3.3 Logging In and Out

Django provides built-in view functions for handling logging in and out (and a few other nifty tricks), but before we

get to those, let’s take a look at how to log users in and out “by hand.” Django provides two functions to perform these

actions in django.contrib.auth:authenticate() and login().

226 Chapter 16. Chapter 14: Sessions, Users, and Registration

The Django Book, Release 2.0

To authenticate a given username and password, use authenticate(). It takes two keyword arguments,

username and password, and it returns a User object if the password is valid for the given username. If the

password is invalid, authenticate() returns None:

>>> from django.contrib import auth

>>> user =auth.authenticate(username=’john’, password=’secret’)

>>> if user is not None:

... print "Correct!"

... else:

... print "Invalid password."

authenticate() only veriﬁes a user’s credentials. To log in a user, use login(). It takes an HttpRequest

object and a User object and saves the user’s ID in the session, using Django’s session framework.

This example shows how you might use both authenticate() and login() within a view function:

from django.contrib import auth

def login_view(request):

username =request.POST.get(’username’,’’)

password =request.POST.get(’password’,’’)

user =auth.authenticate(username=username, password=password)

if user is not None and user.is_active:

# Correct password, and the user is marked "active"

auth.login(request, user)

# Redirect to a success page.

return HttpResponseRedirect("/account/loggedin/")

else:

# Show an error page

return HttpResponseRedirect("/account/invalid/")

To log out a user, use django.contrib.auth.logout() within your view. It takes an HttpRequest object

and has no return value:

from django.contrib import auth

def logout_view(request):

auth.logout(request)

# Redirect to a success page.

return HttpResponseRedirect("/account/loggedout/")

Note that auth.logout() doesn’t throw any errors if the user wasn’t logged in.

In practice, you usually will not need to write your own login/logout functions; the authentication system comes with

a set of views for generically handling logging in and out. The ﬁrst step in using these authentication views is to wire

them up in your URLconf. You’ll need to add this snippet:

from django.contrib.auth.views import login, logout

urlpatterns =patterns(’’,

# existing patterns here...

(r’^accounts/login/$’, login),

(r’^accounts/logout/$’, logout),

)

/accounts/login/ and /accounts/logout/ are the default URLs that Django uses for these views.

By default, the login view renders a template at registration/login.html (you can change this tem-

plate name by passing an extra view argument ,‘‘template_name‘‘). This form needs to contain a username and

apassword ﬁeld. A simple template might look like this:

16.3. Users and Authentication 227

The Django Book, Release 2.0

{% extends "base.html" %}

{% block content %}

{% if form.errors %}

<p class="error">Sorry, that’s not a valid username or password</p>

{% endif %}

<label for="password">Password:</label>

</form>

{% endblock %}

If the user successfully logs in, he or she will be redirected to /accounts/profile/ by default. You can override

this by providing a hidden ﬁeld called next with the URL to redirect to after logging in. You can also pass this value

as a GET parameter to the login view and it will be automatically added to the context as a variable called next that

you can insert into that hidden ﬁeld.

The logout view works a little differently. By default it renders a template at registration/logged_out.html

(which usually contains a “You’ve successfully logged out” message). However, you can call the view with an extra

argument, next_page, which will instruct the view to redirect after a logout.

16.3.4 Limiting Access to Logged-in Users

Of course, the reason we’re going through all this trouble is so we can limit access to parts of our site.

The simple, raw way to limit access to pages is to check request.user.is_authenticated() and redirect to

a login page:

from django.http import HttpResponseRedirect

def my_view(request):

if not request.user.is_authenticated():

return HttpResponseRedirect(’/accounts/login/?next=%s’%request.path)

# ...

or perhaps display an error message:

def my_view(request):

if not request.user.is_authenticated():

return render(request, ’myapp/login_error.html’)

# ...

As a shortcut, you can use the convenient login_required decorator:

from django.contrib.auth.decorators import login_required

@login_required

def my_view(request):

# ...

login_required does the following:

228 Chapter 16. Chapter 14: Sessions, Users, and Registration

The Django Book, Release 2.0

• If the user isn’t logged in, redirect to /accounts/login/, passing the current URL path in the query string

as next, for example: /accounts/login/?next=/polls/3/.

• If the user is logged in, execute the view normally. The view code can then assume that the user is logged in.

16.3.5 Limiting Access to Users Who Pass a Test

Limiting access based on certain permissions or some other test, or providing a different location for the login view

works essentially the same way.

The raw way is to run your test on request.user in the view directly. For example, this view checks to make sure

the user is logged in and has the permission polls.can_vote (more about how permissions works follows):

def vote(request):

if request.user.is_authenticated() and request.user.has_perm(’polls.can_vote’)):

# vote here

else:

return HttpResponse("You can’t vote in this poll.")

Again, Django provides a shortcut called user_passes_test. It takes arguments and generates a specialized

decorator for your particular situation:

def user_can_vote(user):

return user.is_authenticated() and user.has_perm("polls.can_vote")

@user_passes_test(user_can_vote, login_url="/login/")

def vote(request):

# Code here can assume a logged-in user with the correct permission.

...

user_passes_test takes one required argument: a callable that takes a User object and returns True if the

user is allowed to view the page. Note that user_passes_test does not automatically check that the User is

authenticated; you should do that yourself.

In this example we’re also showing the second (optional) argument, login_url, which lets you specify the URL for

your login page (/accounts/login/ by default). If the user doesn’t pass the test, then the user_passes_test

decorator will redirect the user to the login_url.

Because it’s a relatively common task to check whether a user has a particular permission, Django provides a shortcut

for that case: the permission_required() decorator. Using this decorator, the earlier example can be written

as follows:

from django.contrib.auth.decorators import permission_required

@permission_required(’polls.can_vote’, login_url="/login/")

def vote(request):

# ...

Note that permission_required() also takes an optional login_url parameter, which also defaults to

’/accounts/login/’.

Limiting Access to Generic Views

One of the most frequently asked questions on the Django users list deals with limiting access to a generic view. To

pull this off, you’ll need to write a thin wrapper around the view and point your URLconf to your wrapper instead of

the generic view itself:

from django.contrib.auth.decorators import login_required

from django.views.generic.date_based import object_detail

16.3. Users and Authentication 229

The Django Book, Release 2.0

@login_required

def limited_object_detail(*args, **kwargs):

return object_detail(*args, **kwargs)

You can, of course, replace login_required with any of the other limiting decorators.

16.3.6 Managing Users, Permissions, and Groups

The easiest way by far to manage the auth system is through the admin interface. Chapter 6 discusses how to use

Django’s admin site to edit users and control their permissions and access, and most of the time you’ll just use that

interface.

However, there are low-level APIs you can dive into when you need absolute control, and we discuss these in the

sections that follow.

Creating Users

Create users with the create_user helper function:

>>> from django.contrib.auth.models import User

>>> user =User.objects.create_user(username=’john’,

... email=’jlennon@beatles.com’,

... password=’glass onion’)

At this point, user is a User instance ready to be saved to the database (create_user() doesn’t actually call

save() itself). You can continue to change its attributes before saving, too:

>>> user.is_staff =True

>>> user.save()

Changing Passwords

You can change a password with set_password():

>>> user =User.objects.get(username=’john’)

>>> user.set_password(’goo goo goo joob’)

>>> user.save()

Don’t set the password attribute directly unless you know what you’re doing. The password is actually stored as a

salted hash and thus can’t be edited directly.

More formally, the password attribute of a User object is a string in this format:

hashtype$salt$hash

That’s a hash type, the salt, and the hash itself, separated by the dollar sign ($) character.

hashtype is either sha1 (default) or md5, the algorithm used to perform a one-way hash of the password. salt is

a random string used to salt the raw password to create the hash, for example:

sha1$a1976$a36cc8cbf81742a8fb52e221aaeab48ed7f58ab4

The User.set_password() and User.check_password() functions handle the setting and checking of

these values behind the scenes.

230 Chapter 16. Chapter 14: Sessions, Users, and Registration

The Django Book, Release 2.0

Salted hashes

Ahash is a one-way cryptographic function – that is, you can easily compute the hash of a given value, but it’s nearly

impossible to take a hash and reconstruct the original value.

If we stored passwords as plain text, anyone who got their hands on the password database would instantly know

everyone’s password. Storing passwords as hashes reduces the value of a compromised database.

However, an attacker with the password database could still run a brute- force attack, hashing millions of passwords

and comparing those hashes against the stored values. This takes some time, but less than you might think.

Worse, there are publicly available rainbow tables, or databases of pre-computed hashes of millions of passwords.

With a rainbow table, an experienced attacker could break most passwords in seconds.

Adding a salt – basically an initial random value – to the stored hash adds another layer of difﬁculty to breaking

passwords. Because salts differ from password to password, they also prevent the use of a rainbow table, thus forcing

attackers to fall back on a brute-force attack, itself made more difﬁcult by the extra entropy added to the hash by the

salt.

While salted hashes aren’t absolutely the most secure way of storing passwords, they’re a good middle ground between

security and convenience.

Handling Registration

We can use these low-level tools to create views that allow users to sign up for new accounts. Different developers

implement registration differently, so Django leaves writing a registration view up to you. Luckily, it’s pretty easy.

At its simplest, we could provide a small view that prompts for the required user information and creates those users.

Django provides a built-in form you can use for this purpose, which we’ll use in this example:

from django import forms

from django.contrib.auth.forms import UserCreationForm

from django.http import HttpResponseRedirect

from django.shortcuts import render

def register(request):

if request.method == ’POST’:

form =UserCreationForm(request.POST)

if form.is_valid():

new_user =form.save()

return HttpResponseRedirect("/books/")

else:

form =UserCreationForm()

return render(request, "registration/register.html", {

’form’: form,

})

This form assumes a template named registration/register.html. Here’s an example of what that template

might look like:

{% extends "base.html" %}

{% block title %}Create an account{% endblock %}

{% block content %}

<h1>Create an account</h1>

16.3. Users and Authentication 231

The Django Book, Release 2.0

</form>

{% endblock %}

16.3.7 Using Authentication Data in Templates

The current logged-in user and his or her permissions are made available in the template context when you use the

render() shortcut or explicitly use a RequestContext (see Chapter 9).

Note: Technically, these variables are only made available in the template context if

you use RequestContext and your TEMPLATE_CONTEXT_PROCESSORS setting contains

"django.core.context_processors.auth", which is the default. Again, see Chapter 9 for more

information.

When using RequestContext, the current user (either a User instance or an AnonymousUser instance) is stored

in the template variable {{ user }}:

{% if user.is_authenticated %}

<p>Welcome, {{ user.username }}. Thanks for logging in.</p>

{% else %}

<p>Welcome, new user. Please log in.</p>

{% endif %}

This user’s permissions are stored in the template variable {{ perms }}. This is a template-friendly proxy to a

couple of permission methods described shortly.

There are two ways you can use this perms object. You can use something like {% if perms.polls %}

to check if the user has any permissions for some given application, or you can use something like {% if

perms.polls.can_vote %} to check if the user has a speciﬁc permission.

Thus, you can check permissions in template {% if %} statements:

{% if perms.polls %}

<p>You have permission to do something in the polls app.</p>

{% if perms.polls.can_vote %}

{% endif %}

{% else %}

<p>You don’t have permission to do anything in the polls app.</p>

{% endif %}

16.4 Permissions, Groups and Messages

There are a few other bits of the authentication framework that we’ve only dealt with in passing. We’ll take a closer

look at them in the following sections.

16.4.1 Permissions

Permissions are a simple way to “mark” users and groups as being able to perform some action. They are usually used

by the Django admin site, but you can easily use them in your own code.

The Django admin site uses permissions as follows:

232 Chapter 16. Chapter 14: Sessions, Users, and Registration

The Django Book, Release 2.0

• Access to view the “add” form, and add an object is limited to users with the add permission for that type of

object.

• Access to view the change list, view the “change” form, and change an object is limited to users with the change

permission for that type of object.

• Access to delete an object is limited to users with the delete permission for that type of object.

Permissions are set globally per type of object, not per speciﬁc object instance. For example, it’s possible to say “Mary

may change news stories,” but permissions don’t let you say “Mary may change news stories, but only the ones she

created herself” or “Mary may only change news stories that have a certain status, publication date, or ID.”

These three basic permissions – add, change, and delete – are automatically created for each Django model. Be-

hind the scenes, these permissions are added to the auth_permission database table when you run manage.py

syncdb.

These permissions will be of the form "<app>.<action>_<object_name>". That is, if you have

apolls application with a Choice model, you’ll get permissions named "polls.add_choice",

"polls.change_choice", and "polls.delete_choice".

Just like users, permissions are implemented in a Django model that lives in django.contrib.auth.models.

This means that you can use Django’s database API to interact directly with permissions if you like.

16.4.2 Groups

Groups are a generic way of categorizing users so you can apply permissions, or some other label, to those users. A

user can belong to any number of groups.

A user in a group automatically has the permissions granted to that group. For example, if the group Site editors

has the permission can_edit_home_page, any user in that group will have that permission.

Groups are also a convenient way to categorize users to give them some label, or extended functionality. For example,

you could create a group ’Special users’, and you could write code that could, say, give those users access to a

members-only portion of your site, or send them members-only e-mail messages.

Like users, the easiest way to manage groups is through the admin interface. However, groups are also just Django

models that live in django.contrib.auth.models, so once again you can always use Django’s database APIs

to deal with groups at a low level.

16.4.3 Messages

The message system is a lightweight way to queue messages for given users. A message is associated with a User.

There’s no concept of expiration or timestamps.

Messages are used by the Django admin interface after successful actions. For example, when you create an object,

you’ll notice a “The object was created successfully” message at the top of the admin page.

You can use the same API to queue and display messages in your own application. The API is simple:

• To create a new message, use user.message_set.create(message=’message_text’).

• To retrieve/delete messages, use user.get_and_delete_messages(), which returns a list of Message

objects in the user’s queue (if any) and deletes the messages from the queue.

In this example view, the system saves a message for the user after creating a playlist:

def create_playlist(request, songs):

# Create the playlist with the given songs.

# ...

request.user.message_set.create(

16.4. Permissions, Groups and Messages 233

The Django Book, Release 2.0

message="Your playlist was added successfully."

)

return render(request, "playlists/create.html")

When you use the render() shortcut or render a template with a RequestContext, the current logged-in user

and his or her messages are made available in the template context as the template variable {{ messages }}.

Here’s an example of template code that displays messages:

{% if messages %}

<ul>

{% for message in messages %}

<li>{{ message }}</li>

{% endfor %}

</ul>

{% endif %}

Note that RequestContext calls get_and_delete_messages behind the scenes, so any messages will be

deleted even if you don’t display them.

Finally, note that this messages framework only works with users in the user database. To send messages to anonymous

users, use the session framework directly.

16.5 What’s Next

The session and authorization system is a lot to absorb. Most of the time, you won’t need all the features described in

this chapter, but when you need to allow complex interactions between users, it’s good to have all that power available.

In the next chapter, we’ll take a look at Django’s caching infrastructure, which is a convenient way to improve the

performance of your application.

234 Chapter 16. Chapter 14: Sessions, Users, and Registration

CHAPTER 17

Chapter 15: Caching

A fundamental trade-off in dynamic Web sites is, well, they’re dynamic. Each time a user requests a page, the Web

server makes all sorts of calculations – from database queries to template rendering to business logic – to create the

page that your site’s visitor sees. This is a lot more expensive, from a processing-overhead perspective, than your

standard read-a-ﬁle-off-the-ﬁlesystem server arrangement.

For most Web applications, this overhead isn’t a big deal. Most Web applications aren’t washingtonpost.com or

slashdot.org; they’re simply small- to medium-sized sites with so-so trafﬁc. But for medium- to high-trafﬁc sites, it’s

essential to cut as much overhead as possible.

That’s where caching comes in.

To cache something is to save the result of an expensive calculation so that you don’t have to perform the calculation

next time. Here’s some pseudocode explaining how this would work for a dynamically generated Web page:

given a URL, try finding that page in the cache

if the page is in the cache:

return the cached page

else:

generate the page

save the generated page in the cache (for next time)

return the generated page

Django comes with a robust cache system that lets you save dynamic pages so they don’t have to be calculated for

each request. For convenience, Django offers different levels of cache granularity: You can cache the output of speciﬁc

views, you can cache only the pieces that are difﬁcult to produce, or you can cache your entire site.

Django also works well with “upstream” caches, such as Squid (http://www.squid-cache.org/) and browser-based

caches. These are the types of caches that you don’t directly control but to which you can provide hints (via HTTP

headers) about which parts of your site should be cached, and how.

17.1 Setting Up the Cache

The cache system requires a small amount of setup. Namely, you have to tell it where your cached data should live –

whether in a database, on the ﬁlesystem or directly in memory. This is an important decision that affects your cache’s

performance; yes, some cache types are faster than others.

Your cache preference goes in the CACHE_BACKEND setting in your settings ﬁle. Here’s an explanation of all available

values for CACHE_BACKEND.

235

The Django Book, Release 2.0

17.1.1 Memcached

By far the fastest, most efﬁcient type of cache available to Django, Memcached is an entirely memory-based cache

framework originally developed to handle high loads at LiveJournal.com and subsequently open-sourced by Danga

Interactive. It’s used by sites such as Facebook and Wikipedia to reduce database access and dramatically increase site

performance.

Memcached is available for free at http://danga.com/memcached/ . It runs as a daemon and is allotted a speciﬁed

amount of RAM. All it does is provide an fast interface for adding, retrieving and deleting arbitrary data in the cache.

All data is stored directly in memory, so there’s no overhead of database or ﬁlesystem usage.

After installing Memcached itself, you’ll need to install the Memcached Python bindings, which are not bundled with

Django directly. Two versions of this are available. Choose and install one of the following modules:

• The fastest available option is a module called cmemcache, available at http://gijsbert.org/cmemcache/ .

• If you can’t install cmemcache, you can install python-memcached, available at

ftp://ftp.tummy.com/pub/python-memcached/ . If that URL is no longer valid, just go to the Memcached

Web site (http://www.danga.com/memcached/) and get the Python bindings from the “Client APIs” section.

To use Memcached with Django, set CACHE_BACKEND to memcached://ip:port/, where ip is the IP address

of the Memcached daemon and port is the port on which Memcached is running.

In this example, Memcached is running on localhost (127.0.0.1) port 11211:

CACHE_BACKEND =’memcached://127.0.0.1:11211/’

One excellent feature of Memcached is its ability to share cache over multiple servers. This means you can run

Memcached daemons on multiple machines, and the program will treat the group of machines as a single cache,

without the need to duplicate cache values on each machine. To take advantage of this feature, include all server

addresses in CACHE_BACKEND, separated by semicolons.

In this example, the cache is shared over Memcached instances running on IP address 172.19.26.240 and

172.19.26.242, both on port 11211:

CACHE_BACKEND =’memcached://172.19.26.240:11211;172.19.26.242:11211/’

In the following example, the cache is shared over Memcached instances running on the IP addresses 172.19.26.240

(port 11211), 172.19.26.242 (port 11212), and 172.19.26.244 (port 11213):

CACHE_BACKEND =’memcached://172.19.26.240:11211;172.19.26.242:11212;172.19.26.244:11213/’

A ﬁnal point about Memcached is that memory-based caching has one disadvantage: Because the cached data is stored

in memory, the data will be lost if your server crashes. Clearly, memory isn’t intended for permanent data storage, so

don’t rely on memory-based caching as your only data storage. Without a doubt, none of the Django caching backends

should be used for permanent storage – they’re all intended to be solutions for caching, not storage – but we point this

out here because memory-based caching is particularly temporary.

17.1.2 Database Caching

To use a database table as your cache backend, ﬁrst create a cache table in your database by running this command:

python manage.py createcachetable [cache_table_name]

...where [cache_table_name] is the name of the database table to create. (This name can be whatever you want,

as long as it’s a valid table name that’s not already being used in your database.) This command creates a single table

in your database that is in the proper format that Django’s database-cache system expects.

Once you’ve created that database table, set your CACHE_BACKEND setting to "db://tablename", where

tablename is the name of the database table. In this example, the cache table’s name is my_cache_table:

236 Chapter 17. Chapter 15: Caching

The Django Book, Release 2.0

CACHE_BACKEND =’db://my_cache_table’

The database caching backend uses the same database as speciﬁed in your settings ﬁle. You can’t use a different

database backend for your cache table.

Database caching works best if you’ve got a fast, well-indexed database server.

17.1.3 Filesystem Caching

To store cached items on a ﬁlesystem, use the "file://" cache type for CACHE_BACKEND. For example, to store

cached data in /var/tmp/django_cache, use this setting:

CACHE_BACKEND =’file:///var/tmp/django_cache’

Note that there are three forward slashes toward the beginning of that example. The ﬁrst two are for file://, and

the third is the ﬁrst character of the directory path, /var/tmp/django_cache. If you’re on Windows, put the

drive letter after the file://, like this:

file://c:/foo/bar

The directory path should be absolute – that is, it should start at the root of your ﬁlesystem. It doesn’t matter whether

you put a slash at the end of the setting.

Make sure the directory pointed-to by this setting exists and is readable and writable by the system user under which

your Web server runs. Continuing the above example, if your server runs as the user apache, make sure the directory

/var/tmp/django_cache exists and is readable and writable by the user apache.

Each cache value will be stored as a separate ﬁle whose contents are the cache data saved in a serialized (“pickled”)

format, using Python’s pickle module. Each ﬁle’s name is the cache key, escaped for safe ﬁlesystem use.

17.1.4 Local-Memory Caching

If you want the speed advantages of in-memory caching but don’t have the capability of running Memcached, consider

the local-memory cache backend. This cache is multi-process and thread-safe. To use it, set CACHE_BACKEND to

"locmem:///". For example:

CACHE_BACKEND =’locmem:///’

Note that each process will have its own private cache instance, which means no cross-process caching is possible.

This obviously also means the local memory cache isn’t particularly memory-efﬁcient, so it’s probably not a good

choice for production environments. It’s nice for development.

17.1.5 Dummy Caching (For Development)

Finally, Django comes with a “dummy” cache that doesn’t actually cache – it just implements the cache interface

without doing anything.

This is useful if you have a production site that uses heavy-duty caching in various places but a development/test

environment where you don’t want to cache and don’t want to have to change your code to special-case the latter. To

activate dummy caching, set CACHE_BACKEND like so:

CACHE_BACKEND =’dummy:///’

17.1. Setting Up the Cache 237

The Django Book, Release 2.0

17.1.6 Using a Custom Cache Backend

While Django includes support for a number of cache backends out-of-the-box, sometimes you might want to use a

customized cache backend. To use an external cache backend with Django, use a Python import path as the scheme

portion (the part before the initial colon) of the CACHE_BACKEND URI, like so:

CACHE_BACKEND =’path.to.backend://’

If you’re building your own backend, you can use the standard cache backends as reference implementations. You’ll

ﬁnd the code in the django/core/cache/backends/ directory of the Django source.

Note: Without a really compelling reason, such as a host that doesn’t support them, you should stick to the cache

backends included with Django. They’ve been well-tested and are easy to use.

17.1.7 CACHE_BACKEND Arguments

Each cache backend may take arguments. They’re given in query-string style on the CACHE_BACKEND setting. Valid

arguments are as follows:

•timeout: The default timeout, in seconds, to use for the cache. This argument defaults to 300 seconds (5

minutes).

•max_entries: For the locmem,filesystem and database backends, the maximum number of entries

allowed in the cache before old values are deleted. This argument defaults to 300.

•cull_percentage: The percentage of entries that are culled when max_entries is reached. The

actual ratio is 1/cull_percentage, so set cull_percentage=2 to cull half of the entries when

max_entries is reached.

A value of 0for cull_percentage means that the entire cache will be dumped when max_entries is

reached. This makes culling much faster at the expense of more cache misses.

In this example, timeout is set to 60:

CACHE_BACKEND ="memcached://127.0.0.1:11211/?timeout=60"

In this example, timeout is 30 and max_entries is 400:

CACHE_BACKEND ="locmem:///?timeout=30&max_entries=400"

Invalid arguments are silently ignored, as are invalid values of known arguments.

17.2 The Per-Site Cache

Once the cache is set up, the simplest way to use caching is to cache your entire site.

You’ll need to add ’django.middleware.cache.UpdateCacheMiddleware’ and

’django.middleware.cache.FetchFromCacheMiddleware’ to your MIDDLEWARE_CLASSES

setting, as in this example:

MIDDLEWARE_CLASSES =(

’django.middleware.cache.UpdateCacheMiddleware’,

’django.middleware.common.CommonMiddleware’,

’django.middleware.cache.FetchFromCacheMiddleware’,

)

Note: No, that’s not a typo: the “update” middleware must be ﬁrst in the list, and the “fetch” middleware must be

last. The details are a bit obscure, but see Order of MIDDLEWARE_CLASSES below if you’d like the full story.

238 Chapter 17. Chapter 15: Caching

The Django Book, Release 2.0

Then, add the following required settings to your Django settings ﬁle:

•CACHE_MIDDLEWARE_SECONDS – The number of seconds each page should be cached.

•CACHE_MIDDLEWARE_KEY_PREFIX – If the cache is shared across multiple sites using the same Django

installation, set this to the name of the site, or some other string that is unique to this Django instance, to prevent

key collisions. Use an empty string if you don’t care.

The cache middleware caches every page that doesn’t have GET or POST parameters. Optionally, if the

CACHE_MIDDLEWARE_ANONYMOUS_ONLY setting is True, only anonymous requests (i.e., not those made by

a logged-in user) will be cached. This is a simple and effective way of disabling caching for any user-speciﬁc pages

(include Django’s admin interface). Note that if you use CACHE_MIDDLEWARE_ANONYMOUS_ONLY, you should

make sure you’ve activated AuthenticationMiddleware.

Additionally, the cache middleware automatically sets a few headers in each HttpResponse:

• Sets the Last-Modified header to the current date/time when a fresh (uncached) version of the page is

requested.

• Sets the Expires header to the current date/time plus the deﬁned CACHE_MIDDLEWARE_SECONDS.

• Sets the Cache-Control header to give a max age for the page – again, from the

CACHE_MIDDLEWARE_SECONDS setting.

See Chapter 17 for more on middleware.

If a view sets its own cache expiry time (i.e. it has a max-age section in its Cache-Control header) then the

page will be cached until the expiry time, rather than CACHE_MIDDLEWARE_SECONDS. Using the decorators in

django.views.decorators.cache you can easily set a view’s expiry time (using the cache_control

decorator) or disable caching for a view (using the never_cache decorator). See the “Using other headers” section

below for more on these decorators.

17.3 The Per-View Cache

A more granular way to use the caching framework is by caching the output of individual views.

django.views.decorators.cache deﬁnes a cache_page decorator that will automatically cache the view’s

response for you. It’s easy to use:

from django.views.decorators.cache import cache_page

def my_view(request):

# ...

my_view = cache_page(my_view, 60 *15)

Or, using Python 2.4’s decorator syntax:

@cache_page(60 *15)

def my_view(request):

# ...

cache_page takes a single argument: the cache timeout, in seconds. In the above example, the result of the

my_view() view will be cached for 15 minutes. (Note that we’ve written it as 60 *15 for the purpose of read-

ability. 60 *15 will be evaluated to 900 – that is, 15 minutes multiplied by 60 seconds per minute.)

The per-view cache, like the per-site cache, is keyed off of the URL. If multiple URLs point at the same view, each

URL will be cached separately. Continuing the my_view example, if your URLconf looks like this:

17.3. The Per-View Cache 239

The Django Book, Release 2.0

urlpatterns =(’’,

(r’^foo/(\d{1,2})/$’, my_view),

)

then requests to /foo/1/ and /foo/23/ will be cached separately, as you may expect. But once a particular URL

(e.g., /foo/23/) has been requested, subsequent requests to that URL will use the cache.

17.3.1 Specifying Per-View Cache in the URLconf

The examples in the previous section have hard-coded the fact that the view is cached, because cache_page alters

the my_view function in place. This approach couples your view to the cache system, which is not ideal for several

reasons. For instance, you might want to reuse the view functions on another, cache-less site, or you might want to

distribute the views to people who might want to use them without being cached. The solution to these problems is to

specify the per-view cache in the URLconf rather than next to the view functions themselves.

Doing so is easy: simply wrap the view function with cache_page when you refer to it in the URLconf. Here’s the

old URLconf from earlier:

urlpatterns =(’’,

(r’^foo/(\d{1,2})/$’, my_view),

)

Here’s the same thing, with my_view wrapped in cache_page:

from django.views.decorators.cache import cache_page

urlpatterns =(’’,

(r’^foo/(\d{1,2})/$’, cache_page(my_view, 60 *15)),

)

If you take this approach, don’t forget to import cache_page within your URLconf.

17.4 Template Fragment Caching

If you’re after even more control, you can also cache template fragments using the cache template tag. To give your

template access to this tag, put {% load cache %} near the top of your template.

The {% cache %} template tag caches the contents of the block for a given amount of time. It takes at least two

arguments: the cache timeout, in seconds, and the name to give the cache fragment. For example:

{% load cache %}

{% cache 500 sidebar %}

.. sidebar ..

{% endcache %}

Sometimes you might want to cache multiple copies of a fragment depending on some dynamic data that appears

inside the fragment. For example, you might want a separate cached copy of the sidebar used in the previous example

for every user of your site. Do this by passing additional arguments to the {% cache %} template tag to uniquely

identify the cache fragment:

{% load cache %}

{% cache 500 sidebar request.user.username %}

.. sidebar for logged in user ..

{% endcache %}

240 Chapter 17. Chapter 15: Caching

The Django Book, Release 2.0

It’s perfectly ﬁne to specify more than one argument to identify the fragment. Simply pass as many arguments to {%

cache %} as you need.

The cache timeout can be a template variable, as long as the template variable resolves to an integer value. For example,

if the template variable my_timeout is set to the value 600, then the following two examples are equivalent:

{% cache 600 sidebar %} ... {% endcache %}

{% cache my_timeout sidebar %} ... {% endcache %}

This feature is useful in avoiding repetition in templates. You can set the timeout in a variable, in one place, and just

reuse that value.

17.5 The Low-Level Cache API

Sometimes, caching an entire rendered page doesn’t gain you very much and is, in fact, inconvenient overkill.

Perhaps, for instance, your site includes a view whose results depend on several expensive queries, the results of which

change at different intervals. In this case, it would not be ideal to use the full-page caching that the per-site or per-view

cache strategies offer, because you wouldn’t want to cache the entire result (since some of the data changes often), but

you’d still want to cache the results that rarely change.

For cases like this, Django exposes a simple, low-level cache API. You can use this API to store objects in the cache

with any level of granularity you like. You can cache any Python object that can be pickled safely: strings, dictionaries,

lists of model objects, and so forth. (Most common Python objects can be pickled; refer to the Python documentation

for more information about pickling.)

The cache module, django.core.cache, has a cache object that’s automatically created from the

CACHE_BACKEND setting:

>>> from django.core.cache import cache

The basic interface is set(key, value, timeout_seconds) and get(key):

>>> cache.set(’my_key’,’hello, world!’,30)

>>> cache.get(’my_key’)

’hello, world!’

The timeout_seconds argument is optional and defaults to the timeout argument in the CACHE_BACKEND

setting (explained above).

If the object doesn’t exist in the cache, cache.get() returns None:

# Wait 30 seconds for ’my_key’ to expire...

>>> cache.get(’my_key’)

None

We advise against storing the literal value None in the cache, because you won’t be able to distinguish between your

stored None value and a cache miss signiﬁed by a return value of None.

cache.get() can take a default argument. This speciﬁes which value to return if the object doesn’t exist in the

cache:

>>> cache.get(’my_key’,’has expired’)

’has expired’

To add a key only if it doesn’t already exist, use the add() method. It takes the same parameters as set(), but it

will not attempt to update the cache if the key speciﬁed is already present:

17.5. The Low-Level Cache API 241

The Django Book, Release 2.0

>>> cache.set(’add_key’,’Initial value’)

>>> cache.add(’add_key’,’New value’)

>>> cache.get(’add_key’)

’Initial value’

If you need to know whether add() stored a value in the cache, you can check the return value. It will return True

if the value was stored, False otherwise.

There’s also a get_many() interface that only hits the cache once. get_many() returns a dictionary with all the

keys you asked for that actually exist in the cache (and haven’t expired):

>>> cache.set(’a’,1)

>>> cache.set(’b’,2)

>>> cache.set(’c’,3)

>>> cache.get_many([’a’,’b’,’c’])

{’a’: 1, ’b’: 2, ’c’: 3}

Finally, you can delete keys explicitly with delete(). This is an easy way of clearing the cache for a particular

object:

>>> cache.delete(’a’)

You can also increment or decrement a key that already exists using the incr() or decr() methods, respectively.

By default, the existing cache value will incremented or decremented by 1. Other increment/decrement values can

be speciﬁed by providing an argument to the increment/decrement call. A ValueError will be raised if you attempt to

increment or decrement a nonexistent cache key.:

>>> cache.set(’num’,1)

>>> cache.incr(’num’)

>>> cache.incr(’num’,10)

>>> cache.decr(’num’)

>>> cache.decr(’num’,5)

Note: incr()/decr() methods are not guaranteed to be atomic. On those backends that support atomic in-

crement/decrement (most notably, the memcached backend), increment and decrement operations will be atomic.

However, if the backend doesn’t natively provide an increment/decrement operation, it will be implemented using a

two-step retrieve/update.

17.6 Upstream Caches

So far, this chapter has focused on caching your own data. But another type of caching is relevant to Web development,

too: caching performed by “upstream” caches. These are systems that cache pages for users even before the request

reaches your Web site.

Here are a few examples of upstream caches:

• Your ISP may cache certain pages, so if you requested a page from http://example.com/, your ISP would send

you the page without having to access example.com directly. The maintainers of example.com have no knowl-

edge of this caching; the ISP sits between example.com and your Web browser, handling all of the caching

transparently.

242 Chapter 17. Chapter 15: Caching

The Django Book, Release 2.0

• Your Django Web site may sit behind a proxy cache, such as Squid Web Proxy Cache (http://www.squid-

cache.org/), that caches pages for performance. In this case, each request ﬁrst would be handled by the proxy,

and it would be passed to your application only if needed.

• Your Web browser caches pages, too. If a Web page sends out the appropriate headers, your browser will use

the local cached copy for subsequent requests to that page, without even contacting the Web page again to see

whether it has changed.

Upstream caching is a nice efﬁciency boost, but there’s a danger to it: Many Web pages’ contents differ based on

authentication and a host of other variables, and cache systems that blindly save pages based purely on URLs could

expose incorrect or sensitive data to subsequent visitors to those pages.

For example, say you operate a Web e-mail system, and the contents of the “inbox” page obviously depend on which

user is logged in. If an ISP blindly cached your site, then the ﬁrst user who logged in through that ISP would have his

user-speciﬁc inbox page cached for subsequent visitors to the site. That’s not cool.

Fortunately, HTTP provides a solution to this problem. A number of HTTP headers exist to instruct upstream caches to

differ their cache contents depending on designated variables, and to tell caching mechanisms not to cache particular

pages. We’ll look at some of these headers in the sections that follow.

17.7 Using Vary Headers

The Vary header deﬁnes which request headers a cache mechanism should take into account when building its cache

key. For example, if the contents of a Web page depend on a user’s language preference, the page is said to “vary on

language.”

By default, Django’s cache system creates its cache keys using the requested path (e.g.,

"/stories/2005/jun/23/bank_robbed/"). This means every request to that URL will use the same

cached version, regardless of user-agent differences such as cookies or language preferences. However, if this page

produces different content based on some difference in request headers – such as a cookie, or a language, or a

user-agent – you’ll need to use the Vary header to tell caching mechanisms that the page output depends on those

things.

To do this in Django, use the convenient vary_on_headers view decorator, like so:

from django.views.decorators.vary import vary_on_headers

# Python 2.3 syntax.

def my_view(request):

# ...

my_view = vary_on_headers(my_view, ’User-Agent’)

# Python 2.4+ decorator syntax.

@vary_on_headers(’User-Agent’)

def my_view(request):

# ...

In this case, a caching mechanism (such as Django’s own cache middleware) will cache a separate version of the page

for each unique user-agent.

The advantage to using the vary_on_headers decorator rather than manually setting the Vary header (using

something like response[’Vary’] = ’user-agent’) is that the decorator adds to the Vary header (which

may already exist), rather than setting it from scratch and potentially overriding anything that was already in there.

You can pass multiple headers to vary_on_headers():

17.7. Using Vary Headers 243

The Django Book, Release 2.0

@vary_on_headers(’User-Agent’, ’Cookie’)

def my_view(request):

# ...

This tells upstream caches to vary on both, which means each combination of user-agent and cookie will get its own

cache value. For example, a request with the user-agent Mozilla and the cookie value foo=bar will be considered

different from a request with the user-agent Mozilla and the cookie value foo=ham.

Because varying on cookie is so common, there’s a vary_on_cookie decorator. These two views are equivalent:

@vary_on_cookie

def my_view(request):

# ...

@vary_on_headers(’Cookie’)

def my_view(request):

# ...

The headers you pass to vary_on_headers are not case sensitive; "User-Agent" is the same thing as

"user-agent".

You can also use a helper function, django.utils.cache.patch_vary_headers, directly. This function

sets, or adds to, the Vary header. For example:

from django.utils.cache import patch_vary_headers

def my_view(request):

# ...

response =render(request, ’template_name’, context)

patch_vary_headers(response, [’Cookie’])

return response

patch_vary_headers takes an HttpResponse instance as its ﬁrst argument and a list/tuple of case-insensitive

header names as its second argument.

17.8 Controlling Cache: Using Other Headers

Other problems with caching are the privacy of data and the question of where data should be stored in a cascade of

caches.

A user usually faces two kinds of caches: his or her own browser cache (a private cache) and his or her provider’s

cache (a public cache). A public cache is used by multiple users and controlled by someone else. This poses problems

with sensitive data–you don’t want, say, your bank account number stored in a public cache. So Web applications need

a way to tell caches which data is private and which is public.

The solution is to indicate a page’s cache should be “private.” To do this in Django, use the cache_control view

decorator. Example:

from django.views.decorators.cache import cache_control

@cache_control(private=True)

def my_view(request):

# ...

This decorator takes care of sending out the appropriate HTTP header behind the scenes.

There are a few other ways to control cache parameters. For example, HTTP allows applications to do the following:

• Deﬁne the maximum time a page should be cached.

244 Chapter 17. Chapter 15: Caching

The Django Book, Release 2.0

• Specify whether a cache should always check for newer versions, only delivering the cached content when there

are no changes. (Some caches might deliver cached content even if the server page changed, simply because the

cache copy isn’t yet expired.)

In Django, use the cache_control view decorator to specify these cache parameters. In this example,

cache_control tells caches to revalidate the cache on every access and to store cached versions for, at most,

3,600 seconds:

from django.views.decorators.cache import cache_control

@cache_control(must_revalidate=True, max_age=3600)

def my_view(request):

# ...

Any valid Cache-Control HTTP directive is valid in cache_control(). Here’s a full list:

•public=True

•private=True

•no_cache=True

•no_transform=True

•must_revalidate=True

•proxy_revalidate=True

•max_age=num_seconds

•s_maxage=num_seconds

(Note that the caching middleware already sets the cache header’s max-age with the value of the

CACHE_MIDDLEWARE_SETTINGS setting. If you use a custom max_age in a cache_control decorator, the

decorator will take precedence, and the header values will be merged correctly.)

If you want to use headers to disable caching altogether, django.views.decorators.cache.never_cache

is a view decorator that adds headers to ensure the response won’t be cached by browsers or other caches. Example:

from django.views.decorators.cache import never_cache

@never_cache

def myview(request):

# ...

17.9 Other Optimizations

Django comes with a few other pieces of middleware that can help optimize your apps’ performance:

•django.middleware.http.ConditionalGetMiddleware adds support for modern browsers to

conditionally GET responses based on the ETag and Last-Modified headers.

•django.middleware.gzip.GZipMiddleware compresses responses for all moderns browsers, saving

bandwidth and transfer time.

17.10 Order of MIDDLEWARE_CLASSES

If you use caching middleware, it’s important to put each half in the right place within the MIDDLEWARE_CLASSES

setting. That’s because the cache middleware needs to know which headers by which to vary the cache storage.

17.9. Other Optimizations 245

The Django Book, Release 2.0

Middleware always adds something to the Vary response header when it can.

UpdateCacheMiddleware runs during the response phase, where middleware is run in reverse order, so an item at

the top of the list runs last during the response phase. Thus, you need to make sure that UpdateCacheMiddleware

appears before any other middleware that might add something to the Vary header. The following middleware mod-

ules do so:

•SessionMiddleware adds Cookie

•GZipMiddleware adds Accept-Encoding

•LocaleMiddleware adds Accept-Language

FetchFromCacheMiddleware, on the other hand, runs during the request phase, where middleware is applied

ﬁrst-to-last, so an item at the top of the list runs ﬁrst during the request phase. The FetchFromCacheMiddleware

also needs to run after other middleware updates the Vary header, so FetchFromCacheMiddleware must be

after any item that does so.

17.11 What’s Next?

Django ships with a number of “contrib” packages – optional features that can make your life easier. We’ve already

covered a few of these: the admin site (Chapter 6) and the session/user framework (Chapter 14). The next chapter

covers more of the “contributed” subframeworks.

246 Chapter 17. Chapter 15: Caching

CHAPTER 18

Chapter 16: django.contrib

One of the many strengths of Python is its “batteries included” philosophy: when you install Python, it comes with

a large standard library of packages that you can start using immediately, without having to download anything else.

Django aims to follow this philosophy, and it includes its own standard library of add-ons useful for common Web

development tasks. This chapter covers that collection of add-ons.

18.1 The Django Standard Library

Django’s standard library lives in the package django.contrib. Within each subpackage is a separate piece of

add-on functionality. These pieces are not necessarily related, but some django.contrib subpackages may require

other ones.

There’s no hard requirement for the types of functionality in django.contrib. Some of the packages include mod-

els (and hence require you to install their database tables into your database), but others consist solely of middleware

or template tags.

The single characteristic the django.contrib packages have in common is this: if you were to remove the

django.contrib package entirely, you could still use Django’s fundamental features with no problems. When

the Django developers add new functionality to the framework, they use this rule of thumb in deciding whether the

new functionality should live in django.contrib or elsewhere.

django.contrib consists of these packages:

•admin: The Django admin site. See Chapter 6.

•admindocs: Auto-documentation for the Django admin site. This book doesn’t cover this feature; check the

ofﬁcial Django documentation.

•auth: Django’s authentication framework. See Chapter 14.

•comments: A comments application. This book doesn’t cover this feature; check the ofﬁcial Django docu-

mentation.

•contenttypes: A framework for hooking into “types” of content, where each installed Django model is a

separate content type. This framework is used internally by other “contrib” applications and is mostly intended

for very advanced Django developers. Those developers should ﬁnd out more about this application by reading

the source code in django/contrib/contenttypes.

•csrf: Protection against Cross-Site Request Forgery (CSRF). See the later section titled “CSRF Protection.”

•databrowse: A Django application that lets you browse your data. This book doesn’t cover this feature;

check the ofﬁcial Django documentation.

247

The Django Book, Release 2.0

•flatpages: A framework for managing simple “ﬂat” HTML content in a database. See the later section titled

“Flatpages.”

•formtools: A number of useful higher-level libraries for dealing with common patterns in forms. This book

doesn’t cover this feature; check the ofﬁcial Django documentation.

•gis: Extensions to Django that provide for GIS (Geographic Information Systems) support. These, for ex-

ample, allow your Django models to store geographic data and perform geographic queries. This is a large,

complex library and isn’t covered in this book. See http://geodjango.org/ for documentation.

•humanize: A set of Django template ﬁlters useful for adding a “human touch” to data. See the later section

titled “Humanizing Data.”

•localflavor: Assorted pieces of code that are useful for particular countries or cultures. For example, this

includes ways to validate U.S. ZIP codes or Icelandic identiﬁcation numbers.

•markup: A set of Django template ﬁlters that implement a number of common markup languages. See the later

section titled “Markup Filters.”

•redirects: A framework for managing redirects. See the later section titled “Redirects.”

•sessions: Django’s session framework. See Chapter 14.

•sitemaps: A framework for generating sitemap XML ﬁles. See Chapter 13.

•sites: A framework that lets you operate multiple Web sites from the same database and Django installation.

See the next section, “Sites.”

•syndication: A framework for generating syndication feeds in RSS and Atom. See Chapter 13.

•webdesign: Django add-ons that are particularly useful to Web designers (as opposed to developers). As of

this writing, this included only a single template tag, {% lorem %}. Check the Django documentation for

information.

The rest of this chapter goes into detail a number of django.contrib packages that we haven’t yet covered in this

book.

18.2 Sites

Django’s sites system is a generic framework that lets you operate multiple Web sites from the same database and

Django project. This is an abstract concept, and it can be tricky to understand, so we’ll start with a couple of scenarios

where it would be useful.

18.2.1 Scenario 1: Reusing Data on Multiple Sites

As we explained in Chapter 1, the Django-powered sites LJWorld.com and Lawrence.com are operated by the same

news organization: the Lawrence Journal-World newspaper in Lawrence, Kansas. LJWorld.com focuses on news,

while Lawrence.com focuses on local entertainment. But sometimes editors want to publish an article on both sites.

The brain-dead way of solving the problem would be to use a separate database for each site and to require site

producers to publish the same story twice: once for LJWorld.com and again for Lawrence.com. But that’s inefﬁcient

for site producers, and it’s redundant to store multiple copies of the same story in the database.

The better solution? Both sites use the same article database, and an article is associated with one or more sites via a

many-to-many relationship. The Django sites framework provides the database table to which articles can be related.

It’s a hook for associating data with one or more “sites.”

248 Chapter 18. Chapter 16: django.contrib

The Django Book, Release 2.0

18.2.2 Scenario 2: Storing Your Site Name/Domain in One Place

LJWorld.com and Lawrence.com both have e-mail alert functionality, which lets readers sign up to get notiﬁcations

when news happens. It’s pretty basic: a reader signs up on a Web form, and he immediately gets an e-mail saying,

“Thanks for your subscription.”

It would be inefﬁcient and redundant to implement this signup-processing code twice, so the sites use the same code

behind the scenes. But the “Thank you for your subscription” notice needs to be different for each site. By using Site

objects, we can abstract the thank-you notice to use the values of the current site’s name (e.g., ’LJWorld.com’)

and domain (e.g., ’www.ljworld.com’).

The Django sites framework provides a place for you to store the name and domain for each site in your Django

project, which means you can reuse those values in a generic way.

18.2.3 How to Use the Sites Framework

The sites framework is more a series of conventions than a framework. The whole thing is based on two simple

concepts:

• The Site model, found in django.contrib.sites, has domain and name ﬁelds.

• The SITE_ID setting speciﬁes the database ID of the Site object associated with that particular settings ﬁle.

How you use these two concepts is up to you, but Django uses them in a couple of ways automatically via simple

conventions.

To install the sites application, follow these steps:

1. Add ’django.contrib.sites’ to your INSTALLED_APPS.

2. Run the command manage.py syncdb to install the django_site table into your database. This will

also create a default site object, with the domain example.com.

3. Change the example.com site to your own domain, and add any other Site objects, either through the

Django admin site or via the Python API. Create a Site object for each site/domain that this Django project

powers.

4. Deﬁne the SITE_ID setting in each of your settings ﬁles. This value should be the database ID of the Site

object for the site powered by that settings ﬁle.

18.2.4 The Sites Framework’s Capabilities

The sections that follow describe the various things you can do with the sites framework.

Reusing Data on Multiple Sites

To reuse data on multiple sites, as explained in the ﬁrst scenario, just create a ManyToManyField to Site in your

models, for example:

from django.db import models

from django.contrib.sites.models import Site

class Article(models.Model):

headline =models.CharField(max_length=200)

# ...

sites =models.ManyToManyField(Site)

18.2. Sites 249

The Django Book, Release 2.0

That’s the infrastructure you need to associate articles with multiple sites in your database. With that in place, you

can reuse the same Django view code for multiple sites. Continuing the Article model example, here’s what an

article_detail view might look like:

from django.conf import settings

from django.shortcuts import get_object_or_404

from mysite.articles.models import Article

def article_detail(request, article_id):

a=get_object_or_404(Article, id=article_id, sites__id=settings.SITE_ID)

# ...

This view function is reusable because it checks the article’s site dynamically, according to the value of the SITE_ID

setting.

For example, say LJWorld.com’s settings ﬁle has a SITE_ID set to 1, and Lawrence.com’s settings ﬁle has a

SITE_ID set to 2. If this view is called when LJWorld.com’s settings ﬁle is active, then it will limit the article

lookup to articles in which the list of sites includes LJWorld.com.

Associating Content with a Single Site

Similarly, you can associate a model to the Site model in a many-to-one relationship using ForeignKey.

For example, if each article is associated with only a single site, you could use a model like this:

from django.db import models

from django.contrib.sites.models import Site

class Article(models.Model):

headline =models.CharField(max_length=200)

# ...

site =models.ForeignKey(Site)

This has the same beneﬁts as described in the last section.

Hooking Into the Current Site from Views

On a lower level, you can use the sites framework in your Django views to do particular things based on the site in

which the view is being called, for example:

from django.conf import settings

def my_view(request):

if settings.SITE_ID == 3:

# Do something.

else:

# Do something else.

Of course, it’s ugly to hard-code the site IDs like that. A slightly cleaner way of accomplishing the same thing is to

check the current site’s domain:

from django.conf import settings

from django.contrib.sites.models import Site

def my_view(request):

current_site = Site.objects.get(id=settings.SITE_ID)

if current_site.domain == ’foo.com’:

# Do something

250 Chapter 18. Chapter 16: django.contrib

The Django Book, Release 2.0

else:

# Do something else.

The idiom of retrieving the Site object for the value of settings.SITE_ID is quite common, so the Site

model’s manager (Site.objects) has a get_current() method. This example is equivalent to the previous

one:

from django.contrib.sites.models import Site

def my_view(request):

current_site = Site.objects.get_current()

if current_site.domain == ’foo.com’:

# Do something

else:

# Do something else.

Note: In this ﬁnal example, you don’t have to import django.conf.settings.

Getting the Current Domain for Display

For a DRY (Don’t Repeat Yourself) approach to storing your site’s name and domain name, as explained in “Scenario

2: Storing Your Site Name/Domain in One Place,” just reference the name and domain of the current Site object.

For example:

from django.contrib.sites.models import Site

from django.core.mail import send_mail

def register_for_newsletter(request):

# Check form values, etc., and subscribe the user.

# ...

current_site =Site.objects.get_current()

send_mail(’Thanks for subscribing to %s alerts’ %current_site.name,

’Thanks for your subscription. We appreciate it.\n\n-The %s team.’ %current_site.name,

’editor@%s’%current_site.domain,

[user_email])

# ...

Continuing our ongoing example of LJWorld.com and Lawrence.com, on Lawrence.com this e-mail has the subject

line “Thanks for subscribing to lawrence.com alerts.” On LJWorld.com, the e-mail has the subject line “Thanks for

subscribing to LJWorld.com alerts.” This same site-speciﬁc behavior is applied to the e-mails’ message body.

An even more ﬂexible (but more heavyweight) way of doing this would be to use Django’s template system. Assuming

Lawrence.com and LJWorld.com have different template directories (TEMPLATE_DIRS), you could simply delegate

to the template system like so:

from django.core.mail import send_mail

from django.template import loader, Context

def register_for_newsletter(request):

# Check form values, etc., and subscribe the user.

# ...

subject =loader.get_template(’alerts/subject.txt’).render(Context({}))

message =loader.get_template(’alerts/message.txt’).render(Context({}))

send_mail(subject, message, ’do-not-reply@example.com’, [user_email])

# ...

18.2. Sites 251

The Django Book, Release 2.0

In this case, you have to create subject.txt and message.txt templates in both the LJWorld.com and

Lawrence.com template directories. As mentioned previously, that gives you more ﬂexibility, but it’s also more com-

plex.

It’s a good idea to exploit the Site objects as much as possible to remove unneeded complexity and redundancy.

18.2.5 CurrentSiteManager

If Site objects play a key role in your application, consider using the CurrentSiteManager in your model(s).

It’s a model manager (see Chapter 10) that automatically ﬁlters its queries to include only objects associated with the

current Site.

Use CurrentSiteManager by adding it to your model explicitly. For example:

from django.db import models

from django.contrib.sites.models import Site

from django.contrib.sites.managers import CurrentSiteManager

class Photo(models.Model):

photo =models.FileField(upload_to=’/home/photos’)

photographer_name =models.CharField(max_length=100)

pub_date =models.DateField()

site =models.ForeignKey(Site)

objects =models.Manager()

on_site =CurrentSiteManager()

With this model, Photo.objects.all() will return all Photo objects in the database, but

Photo.on_site.all() will return only the Photo objects associated with the current site, according to

the SITE_ID setting.

In other words, these two statements are equivalent:

Photo.objects.filter(site=settings.SITE_ID)

Photo.on_site.all()

How did CurrentSiteManager know which ﬁeld of Photo was the Site? It defaults to looking for a ﬁeld

called site. If your model has a ForeignKey or ManyToManyField called something other than site, you

need to explicitly pass that as the parameter to CurrentSiteManager. The following model, which has a ﬁeld

called publish_on, demonstrates this:

from django.db import models

from django.contrib.sites.models import Site

from django.contrib.sites.managers import CurrentSiteManager

class Photo(models.Model):

photo =models.FileField(upload_to=’/home/photos’)

photographer_name =models.CharField(max_length=100)

pub_date =models.DateField()

publish_on =models.ForeignKey(Site)

objects =models.Manager()

on_site =CurrentSiteManager(’publish_on’)

If you attempt to use CurrentSiteManager and pass a ﬁeld name that doesn’t exist, Django will raise a

ValueError.

Note: You’ll probably want to keep a normal (non-site-speciﬁc) Manager on your model, even if you use

CurrentSiteManager. As explained in Appendix B, if you deﬁne a manager manually, then Django won’t create

the automatic objects = models.Manager() manager for you.

252 Chapter 18. Chapter 16: django.contrib

The Django Book, Release 2.0

Also, certain parts of Django – namely, the Django admin site and generic views – use whichever manager is deﬁned

ﬁrst in the model, so if you want your admin site to have access to all objects (not just site-speciﬁc ones), put objects

= models.Manager() in your model, before you deﬁne CurrentSiteManager.

18.2.6 How Django Uses the Sites Framework

Although it’s not required that you use the sites framework, it’s encouraged, because Django takes advantage of it in

a few places. Even if your Django installation is powering only a single site, you should take a few seconds to create

the site object with your domain and name, and point to its ID in your SITE_ID setting.

Here’s how Django uses the sites framework:

• In the redirects framework (see the later section “Redirects”), each redirect object is associated with a particular

site. When Django searches for a redirect, it takes into account the current SITE_ID.

• In the comments framework, each comment is associated with a particular site. When a comment is posted, its

site is set to the current SITE_ID, and when comments are listed via the appropriate template tag, only the

comments for the current site are displayed.

• In the ﬂatpages framework (see the later section “Flatpages”), each ﬂatpage is associated with a particular site.

When a ﬂatpage is created, you specify its site, and the ﬂatpage middleware checks the current SITE_ID in

retrieving ﬂatpages to display.

• In the syndication framework (see Chapter 13), the templates for title and description automatically

have access to a variable {{ site }}, which is the Site object representing the current site. Also, the

hook for providing item URLs will use the domain from the current Site object if you don’t specify a fully

qualiﬁed domain.

• In the authentication framework (see Chapter 14), the django.contrib.auth.views.login view

passes the current Site name to the template as {{ site_name }} and the current Site object as {{

site }}.

18.3 Flatpages

Often you’ll have a database-driven Web application up and running, but you’ll need to add a couple of one-off static

pages, such as an About page or a Privacy Policy page. It would be possible to use a standard Web server such as

Apache to serve these ﬁles as ﬂat HTML ﬁles, but that introduces an extra level of complexity into your application,

because then you have to worry about conﬁguring Apache, you have to set up access for your team to edit those ﬁles,

and you can’t take advantage of Django’s template system to style the pages.

The solution to this problem is Django’s ﬂatpages application, which lives in the package

django.contrib.flatpages. This application lets you manage such one-off pages via Django’s admin

site, and it lets you specify templates for them using Django’s template system. It uses Django models behind the

scenes, which means it stores the pages in a database, just like the rest of your data, and you can access ﬂatpages with

the standard Django database API.

Flatpages are keyed by their URL and site. When you create a ﬂatpage, you specify which URL it’s associated with,

along with which site(s) it’s on. (For more on sites, see the “Sites” section.)

18.3.1 Using Flatpages

To install the ﬂatpages application, follow these steps:

18.3. Flatpages 253

The Django Book, Release 2.0

1. Add ’django.contrib.flatpages’ to your INSTALLED_APPS.django.contrib.flatpages

depends on django.contrib.sites, so make sure the both packages are in INSTALLED_APPS.

2. Add ’django.contrib.flatpages.middleware.FlatpageFallbackMiddleware’ to your

MIDDLEWARE_CLASSES setting.

3. Run the command manage.py syncdb to install the two required tables into your database.

The ﬂatpages application creates two tables in your database: django_flatpage and

django_flatpage_sites.django_flatpage simply maps a URL to a title and bunch of text con-

tent. django_flatpage_sites is a many-to-many table that associates a ﬂatpage with one or more sites.

The application comes with a single FlatPage model, deﬁned in django/contrib/flatpages/models.py.

It looks something like this:

from django.db import models

from django.contrib.sites.models import Site

class FlatPage(models.Model):

url =models.CharField(max_length=100, db_index=True)

title =models.CharField(max_length=200)

content =models.TextField(blank=True)

enable_comments =models.BooleanField()

template_name =models.CharField(max_length=70, blank=True)

registration_required =models.BooleanField()

sites =models.ManyToManyField(Site)

Let’s examine these ﬁelds one at a time:

•url: The URL at which this ﬂatpage lives, excluding the domain name but including the leading slash (e.g.,

/about/contact/).

•title: The title of the ﬂatpage. The framework doesn’t do anything special with this. It’s your responsibility

to display it in your template.

•content: The content of the ﬂatpage (i.e., the HTML of the page). The framework doesn’t do anything special

with this. It’s your responsibility to display it in the template.

•enable_comments: Whether to enable comments on this ﬂatpage. The framework doesn’t do anything

special with this. You can check this value in your template and display a comment form if needed.

•template_name: The name of the template to use for rendering this ﬂatpage. This is optional; if it’s not given

or if this template doesn’t exist, the framework will fall back to the template flatpages/default.html.

•registration_required: Whether registration is required for viewing this ﬂatpage. This integrates with

Django’s authentication/user framework, which is explained further in Chapter 14.

•sites: The sites that this ﬂatpage lives on. This integrates with Django’s sites framework, which is explained

in the “Sites” section of this chapter.

You can create ﬂatpages through either the Django admin interface or the Django database API. For more information

on this, see the section “Adding, Changing, and Deleting Flatpages.”

Once you’ve created ﬂatpages, FlatpageFallbackMiddleware does all of the work. Each time any Django

application raises a 404 error, this middleware checks the ﬂatpages database for the requested URL as a last resort.

Speciﬁcally, it checks for a ﬂatpage with the given URL with a site ID that corresponds to the SITE_ID setting.

If it ﬁnds a match, it loads the ﬂatpage’s template or flatpages/default.html if the ﬂatpage has not speciﬁed

a custom template. It passes that template a single context variable, flatpage, which is the FlatPage object. It

uses RequestContext in rendering the template.

If FlatpageFallbackMiddleware doesn’t ﬁnd a match, the request continues to be processed as usual.

254 Chapter 18. Chapter 16: django.contrib

The Django Book, Release 2.0

Note: This middleware only gets activated for 404 (page not found) errors – not for 500 (server error) or

other error responses. Also note that the order of MIDDLEWARE_CLASSES matters. Generally, you can put

FlatpageFallbackMiddleware at or near the end of the list, because it’s a last resort.

18.3.2 Adding, Changing, and Deleting Flatpages

You can add, change and delete ﬂatpages in two ways:

Via the Admin Interface

If you’ve activated the automatic Django admin interface, you should see a “Flatpages” section on the admin index

page. Edit ﬂatpages as you would edit any other object in the system.

Via the Python API

As described previously, ﬂatpages are represented by a standard Django model that lives in

django/contrib/flatpages/models.py. Hence, you can access ﬂatpage objects via the Django

database API, for example:

>>> from django.contrib.flatpages.models import FlatPage

>>> from django.contrib.sites.models import Site

>>> fp =FlatPage.objects.create(

... url=’/about/’,

... title=’About’,

... content=’<p>About this site...</p>’,

... enable_comments=False,

... template_name=’’,

... registration_required=False,

... )

>>> fp.sites.add(Site.objects.get(id=1))

>>> FlatPage.objects.get(url=’/about/’)

<FlatPage: /about/ -- About>

18.3.3 Using Flatpage Templates

By default, ﬂatpages are rendered via the template flatpages/default.html, but you can override that for a

particular ﬂatpage with the template_name ﬁeld on the FlatPage object.

Creating the flatpages/default.html template is your responsibility. In your template directory, just create a

flatpages directory containing a default.html ﬁle.

Flatpage templates are passed a single context variable, flatpage, which is the ﬂatpage object.

Here’s a sample flatpages/default.html template:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"

"http://www.w3.org/TR/REC-html40/loose.dtd">

<html>

<head>

<title>{{ flatpage.title }}</title>

</head>

<body>

18.3. Flatpages 255

The Django Book, Release 2.0

</body>

</html>

Note that we’ve used the safe template ﬁlter to allow flatpage.content to include raw HTML and bypass

auto-escaping.

18.4 Redirects

Django’s redirects framework lets you manage redirects easily by storing them in a database and treating them as any

other Django model object. For example, you can use the redirects framework to tell Django, “Redirect any request

to /music/ to /sections/arts/music/.” This comes in handy when you need to move things around on your

site; Web developers should do whatever is necessary to avoid broken links.

18.4.1 Using the Redirects Framework

To install the redirects application, follow these steps:

1. Add ’django.contrib.redirects’ to your INSTALLED_APPS.

2. Add ’django.contrib.redirects.middleware.RedirectFallbackMiddleware’ to your

MIDDLEWARE_CLASSES setting.

3. Run the command manage.py syncdb to install the single required table into your database.

manage.py syncdb creates a django_redirect table in your database. This is a simple lookup table with

site_id,old_path, and new_path ﬁelds.

You can create redirects through either the Django admin interface or the Django database API. For more, see the

section “Adding, Changing, and Deleting Redirects.”

Once you’ve created redirects, the RedirectFallbackMiddleware class does all of the work. Each time any

Django application raises a 404 error, this middleware checks the redirects database for the requested URL as a last

resort. Speciﬁcally, it checks for a redirect with the given old_path with a site ID that corresponds to the SITE_ID

setting. (See the earlier section “Sites” for more information on SITE_ID and the sites framework.) Then it follows

these steps:

• If it ﬁnds a match, and new_path is not empty, it redirects to new_path.

• If it ﬁnds a match, and new_path is empty, it sends a 410 (“Gone”) HTTP header and an empty (contentless)

response.

• If it doesn’t ﬁnd a match, the request continues to be processed as usual.

The middleware only gets activated for 404 errors – not for 500 errors or responses of any other status code.

Note that the order of MIDDLEWARE_CLASSES matters. Generally, you can put

RedirectFallbackMiddleware toward the end of the list, because it’s a last resort.

Note: If you’re using both the redirect and ﬂatpage fallback middleware, consider which one (redirect or ﬂatpage)

you’d like checked ﬁrst. We suggest ﬂatpages before redirects (thus putting the ﬂatpage middleware before the redirect

middleware), but you might feel differently.

18.4.2 Adding, Changing, and Deleting Redirects

You can add, change and delete redirects in two ways:

256 Chapter 18. Chapter 16: django.contrib

The Django Book, Release 2.0

Via the Admin Interface

If you’ve activated the automatic Django admin interface, you should see a “Redirects” section on the admin index

page. Edit redirects as you would edit any other object in the system.

Via the Python API

Redirects are represented by a standard Django model that lives in django/contrib/redirects/models.py.

Hence, you can access redirect objects via the Django database API, for example:

>>> from django.contrib.redirects.models import Redirect

>>> from django.contrib.sites.models import Site

>>> red =Redirect.objects.create(

... site=Site.objects.get(id=1),

... old_path=’/music/’,

... new_path=’/sections/arts/music/’,

... )

>>> Redirect.objects.get(old_path=’/music/’)

<Redirect: /music/ ---> /sections/arts/music/>

18.5 CSRF Protection

The django.contrib.csrf package protects against Cross-Site Request Forgery (CSRF).

CSRF, also known as “session riding,” is a Web site security exploit. It happens when a malicious Web site tricks a

user into unknowingly loading a URL from a site at which that user is already authenticated, hence taking advantage

of the user’s authenticated status. This can be a bit tricky to understand at ﬁrst, so we walk through two examples in

this section.

18.5.1 A Simple CSRF Example

Suppose you’re logged in to a webmail account at example.com. This webmail site has a Log Out button that points

to the URL example.com/logout – that is, the only action you need to take in order to log out is to visit the page

example.com/logout.

A malicious site can coerce you to visit the URL example.com/logout by including that URL as a hidden

the malicious page that has an <iframe> to example.com/logout, the act of visiting the malicious page will

log you out from example.com.

Clearly, being logged out of a webmail site against your will is not a terrifying breach of security, but this same type of

exploit can happen to any site that trusts users, such as an online banking site or an e-commerce site, where the exploit

could be used to initiate an order or payment without the user’s knowledge.

18.5.2 A More Complex CSRF Example

In the previous example, example.com was partially at fault because it allowed a state change (i.e., logging the user

out) to be requested via the HTTP GET method. It’s much better practice to require an HTTP POST for any request

that changes state on the server. But even Web sites that require POST for state-changing actions are vulnerable to

CSRF.

Suppose example.com has upgraded its Log Out functionality so that it’s a <form> button that is requested via

POST to the URL example.com/logout. Furthermore, the logout <form> includes this hidden ﬁeld:

18.5. CSRF Protection 257

The Django Book, Release 2.0

This ensures that a simple POST to the URL example.com/logout won’t log a user out; in order for a user to log

out, the user must request example.com/logout via POST and send the confirm POST variable with a value

of ’true’.

Well, despite the extra security, this arrangement can still be exploited by CSRF – the malicious page just needs to do

a little more work. Attackers can create an entire form targeting your site, hide it in an invisible <iframe>, and then

use JavaScript to submit that form automatically.

18.5.3 Preventing CSRF

How, then, can your site protect itself from this exploit? The ﬁrst step is to make sure all GET requests are free of side

effects. That way, if a malicious site includes one of your pages as an <iframe>, it won’t have a negative effect.

That leaves POST requests. The second step is to give each POST <form> a hidden ﬁeld whose value is secret and

is generated from the user’s session ID. Then, when processing the form on the server side, check for that secret ﬁeld

and raise an error if it doesn’t validate.

This is exactly what Django’s CSRF prevention layer does, as explained in the sections that follow.

Using the CSRF Middleware

The django.contrib.csrf package contains only one module: middleware.py. This module contains a

Django middleware class, CsrfMiddleware, which implements the CSRF protection.

To activate this CSRF protection, add ’django.contrib.csrf.middleware.CsrfMiddleware’ to the

MIDDLEWARE_CLASSES setting in your settings ﬁle. This middleware needs to process the response after

SessionMiddleware, so CsrfMiddleware must appear before SessionMiddleware in the list (because

the response middleware is processed last-to-ﬁrst). Also, it must process the response before the response gets com-

pressed or otherwise mangled, so CsrfMiddleware must come after GZipMiddleware. Once you’ve added that

to your MIDDLEWARE_CLASSES setting, you’re done. See the section “Order of MIDDLEWARE_CLASSES” in

Chapter 15 for more explanation.

In case you’re interested, here’s how CsrfMiddleware works. It does these two things:

1. It modiﬁes outgoing requests by adding a hidden form ﬁeld to all POST forms, with the name

csrfmiddlewaretoken and a value that is a hash of the session ID plus a secret key. The middleware

does not modify the response if there’s no session ID set, so the performance penalty is negligible for requests

that don’t use sessions.

2. On all incoming POST requests that have the session cookie set, it checks that csrfmiddlewaretoken is

present and correct. If it isn’t, the user will get a 403 HTTP error. The content of the 403 error page is the

message “Cross Site Request Forgery detected. Request aborted.”

This ensures that only forms originating from your Web site can be used to POST data back.

This middleware deliberately targets only HTTP POST requests (and the corresponding POST forms). As we ex-

plained, GET requests ought never to have side effects; it’s your own responsibility to ensure this.

POST requests not accompanied by a session cookie are not protected, but they don’t need to be protected, because a

malicious Web site could make these kind of requests anyway.

To avoid altering non-HTML requests, the middleware checks the response’s Content-Type header before modi-

fying it. Only pages that are served as text/html or application/xml+xhtml are modiﬁed.

258 Chapter 18. Chapter 16: django.contrib

The Django Book, Release 2.0

Limitations of the CSRF Middleware

CsrfMiddleware requires Django’s session framework to work. (See Chapter 14 for more on sessions.) If you’re

using a custom session or authentication framework that manually manages session cookies, this middleware will not

help you.

If your application creates HTML pages and forms in some unusual way (e.g., if it sends fragments of HTML in

JavaScript document.write statements), you might bypass the ﬁlter that adds the hidden ﬁeld to the form. In this

case, the form submission will always fail. (This happens because CsrfMiddleware uses a regular expression to

add the csrfmiddlewaretoken ﬁeld to your HTML before the page is sent to the client, and the regular expression

sometimes cannot handle wacky HTML.) If you suspect this might be happening, just view the source in your Web

browser to see whether csrfmiddlewaretoken was inserted into your <form>.

For more CSRF information and examples, visit http://en.wikipedia.org/wiki/CSRF

18.6 Humanizing Data

The package django.contrib.humanize holds a set of Django template ﬁlters useful for adding a “human

touch” to data. To activate these ﬁlters, add ’django.contrib.humanize’ to your INSTALLED_APPS. Once

you’ve done that, use {% load humanize %} in a template, and you’ll have access to the ﬁlters described in the

following sections.

18.6.1 apnumber

For numbers 1 through 9, this ﬁlter returns the number spelled out. Otherwise, it returns the numeral. This follows

Associated Press style.

Examples:

• 1 becomes “one”.

• 2 becomes “two”.

• 10 becomes “10”.

You can pass in either an integer or a string representation of an integer.

18.6.2 intcomma

This ﬁlter converts an integer to a string containing commas every three digits.

Examples:

• 4500 becomes “4,500”.

• 45000 becomes “45,000”.

• 450000 becomes “450,000”.

• 4500000 becomes “4,500,000”.

You can pass in either an integer or a string representation of an integer.

18.6. Humanizing Data 259

The Django Book, Release 2.0

18.6.3 intword

This ﬁlter converts a large integer to a friendly text representation. It works best for numbers over 1 million.

Examples:

• 1000000 becomes “1.0 million”.

• 1200000 becomes “1.2 million”.

• 1200000000 becomes “1.2 billion”.

Values up to 1 quadrillion (1,000,000,000,000,000) are supported.

You can pass in either an integer or a string representation of an integer.

18.6.4 ordinal

This ﬁlter converts an integer to its ordinal as a string.

Examples:

• 1 becomes “1st”.

• 2 becomes “2nd”.

• 3 becomes “3rd”.

• 254 becomes “254th”.

You can pass in either an integer or a string representation of an integer.

18.7 Markup Filters

The package django.contrib.markup includes a handful of Django template ﬁlters, each of which implements

a common markup languages:

•textile: Implements Textile (http://en.wikipedia.org/wiki/Textile_%28markup_language%29)

•markdown: Implements Markdown (http://en.wikipedia.org/wiki/Markdown)

•restructuredtext: Implements ReStructured Text (http://en.wikipedia.org/wiki/ReStructuredText)

In each case, the ﬁlter expects formatted markup as a string and returns a string representing the marked-up text. For

example, the textile ﬁlter converts text that is marked up in Textile format to HTML:

{% load markup %}

To activate these ﬁlters, add ’django.contrib.markup’ to your INSTALLED_APPS setting. Once you’ve

done that, use {% load markup %} in a template, and you’ll have access to these ﬁlters. For more documentation,

read the source code in django/contrib/markup/templatetags/markup.py.

18.8 What’s Next?

Many of these contributed frameworks (CSRF, the auth system, etc.) do their magic by providing a piece of middle-

ware. Middleware is code that runs before and/or after every request and can modify requests and responses at will, to

260 Chapter 18. Chapter 16: django.contrib

The Django Book, Release 2.0

extend the framework. In the next chapter, we’ll discuss Django’s built-in middleware and explain how you can write

your own.

18.8. What’s Next? 261

The Django Book, Release 2.0

262 Chapter 18. Chapter 16: django.contrib

CHAPTER 19

Chapter 17: Middleware

On occasion, you’ll need to run a piece of code on each and every request that Django handles. This code might need

to modify the request before the view handles it, it might need to log information about the request for debugging

purposes, and so forth.

You can do this with Django’s middleware framework, which is a set of hooks into Django’s request/response process-

ing. It’s a light, low-level “plug-in” system capable of globally altering both Django’s input and output.

Each middleware component is responsible for doing some speciﬁc function. If you’re reading this book straight

through, you’ve seen middleware a number of times already:

• All of the session and user tools that we looked at in Chapter 14 are made possible by a few small pieces of

middleware (more speciﬁcally, the middleware makes request.session and request.user available to

you in views).

• The sitewide cache discussed in Chapter 15 is actually just a piece of middleware that bypasses the call to your

view function if the response for that view has already been cached.

• The flatpages,redirects, and csrf applications from Chapter 16 all do their magic through middleware

components.

This chapter dives deeper into exactly what middleware is and how it works, and explains how you can write your own

middleware.

19.1 What’s Middleware?

Let’s start with a very simple example.

High-trafﬁc sites often need to deploy Django behind a load-balancing proxy (see Chapter 12). This can cause a few

small complications, one of which is that every request’s remote IP (request.META["REMOTE_IP"]) will be that

of the load balancer, not the actual IP making the request. Load balancers deal with this by setting a special header,

X-Forwarded-For, to the actual requesting IP address.

So here’s a small bit of middleware that lets sites running behind a proxy still see the correct IP address in

request.META["REMOTE_ADDR"]:

class SetRemoteAddrFromForwardedFor(object):

def process_request(self, request):

try:

real_ip =request.META[’HTTP_X_FORWARDED_FOR’]

except KeyError:

pass

else:

263

The Django Book, Release 2.0

# HTTP_X_FORWARDED_FOR can be a comma-separated list of IPs.

# Take just the first one.

real_ip =real_ip.split(",")[0]

request.META[’REMOTE_ADDR’]=real_ip

(Note: Although the HTTP header is called X-Forwarded-For, Django makes it available as

request.META[’HTTP_X_FORWARDED_FOR’]. With the exception of content-length and

content-type, any HTTP headers in the request are converted to request.META keys by converting all

characters to uppercase, replacing any hyphens with underscores and adding an HTTP_ preﬁx to the name.)

If this middleware is installed (see the next section), every request’s X-Forwarded-For value will be au-

tomatically inserted into request.META[’REMOTE_ADDR’]. This means your Django applications don’t

need to be concerned with whether they’re behind a load-balancing proxy or not; they can simply access

request.META[’REMOTE_ADDR’], and that will work whether or not a proxy is being used.

In fact, this is a common enough need that this piece of middleware is a built-in part of Django. It lives in

django.middleware.http, and you can read a bit more about it later in this chapter.

19.2 Middleware Installation

If you’ve read this book straight through, you’ve already seen a number of examples of middleware installation;

many of the examples in previous chapters have required certain middleware. For completeness, here’s how to install

middleware.

To activate a middleware component, add it to the MIDDLEWARE_CLASSES tuple in your settings module. In

MIDDLEWARE_CLASSES, each middleware component is represented by a string: the full Python path to the mid-

dleware’s class name. For example, here’s the default MIDDLEWARE_CLASSES created by django-admin.py

startproject:

MIDDLEWARE_CLASSES =(

’django.middleware.common.CommonMiddleware’,

’django.contrib.sessions.middleware.SessionMiddleware’,

’django.contrib.auth.middleware.AuthenticationMiddleware’,

)

A Django installation doesn’t require any middleware – MIDDLEWARE_CLASSES can be empty, if you’d like – but

we recommend that you activate CommonMiddleware, which we explain shortly.

The order is signiﬁcant. On the request and view phases, Django applies middleware in the order given in

MIDDLEWARE_CLASSES, and on the response and exception phases, Django applies middleware in reverse order.

That is, Django treats MIDDLEWARE_CLASSES as a sort of “wrapper” around the view function: on the request it

walks down the list to the view, and on the response it walks back up.

19.3 Middleware Methods

Now that you know what middleware is and how to install it, let’s take a look at all the available methods that middle-

ware classes can deﬁne.

19.3.1 Initializer: __init__(self)

Use __init__() to perform systemwide setup for a given middleware class.

264 Chapter 19. Chapter 17: Middleware

The Django Book, Release 2.0

For performance reasons, each activated middleware class is instantiated only once per server process. This means

that __init__() is called only once – at server startup – not for individual requests.

A common reason to implement an __init__() method is to check whether the middleware is indeed needed. If

__init__() raises django.core.exceptions.MiddlewareNotUsed, then Django will remove the mid-

dleware from the middleware stack. You might use this feature to check for some piece of software that the middleware

class requires, or check whether the server is running debug mode, or any other such environment situation.

If a middleware class deﬁnes an __init__() method, the method should take no arguments beyond the standard

self.

19.3.2 Request Preprocessor: process_request(self, request)

This method gets called as soon as the request has been received – before Django has parsed the URL to determine

which view to execute. It gets passed the HttpRequest object, which you may modify at will.

process_request() should return either None or an HttpResponse object.

• If it returns None, Django will continue processing this request, executing any other middleware and then the

appropriate view.

• If it returns an HttpResponse object, Django won’t bother calling any other middleware (of any type) or the

appropriate view. Django will immediately return that HttpResponse.

19.3.3 View Preprocessor: process_view(self, request, view, args, kwargs)

This method gets called after the request preprocessor is called and Django has determined which view to execute, but

before that view has actually been executed.

The arguments passed to this view are shown in Table 17-1.

Table 19.1: Table 17-1. Arguments Passed to process_view()

Argu-

ment

Explanation

request The HttpRequest object.

view The Python function that Django will call to handle this request. This is the actual function object

itself, not the name of the function as a string.

args The list of positional arguments that will be passed to the view, not including the request argument

(which is always the ﬁrst argument to a view).

kwargs The dictionary of keyword arguments that will be passed to the view.

Just like process_request(),process_view() should return either None or an HttpResponse object.

• If it returns None, Django will continue processing this request, executing any other middleware and then the

appropriate view.

• If it returns an HttpResponse object, Django won’t bother calling any other middleware (of any type) or the

appropriate view. Django will immediately return that HttpResponse.

19.3.4 Response Postprocessor: process_response(self, request, response)

This method gets called after the view function is called and the response is generated. Here, the processor can modify

the content of a response. One obvious use case is content compression, such as gzipping of the request’s HTML.

The parameters should be pretty self-explanatory: request is the request object, and response is the response

object returned from the view.

19.3. Middleware Methods 265

The Django Book, Release 2.0

Unlike the request and view preprocessors, which may return None,process_response() must return an

HttpResponse object. That response could be the original one passed into the function (possibly modiﬁed) or

a brand-new one.

19.3.5 Exception Postprocessor: process_exception(self, request, exception)

This method gets called only if something goes wrong and a view raises an uncaught exception. You can use this hook

to send error notiﬁcations, dump postmortem information to a log, or even try to recover from the error automatically.

The parameters to this function are the same request object we’ve been dealing with all along, and exception,

which is the actual Exception object raised by the view function.

process_exception() should return a either None or an HttpResponse object.

• If it returns None, Django will continue processing this request with the framework’s built-in exception han-

dling.

• If it returns an HttpResponse object, Django will use that response instead of the framework’s built-in

exception handling.

Note: Django ships with a number of middleware classes (discussed in the following section) that make good

examples. Reading the code for them should give you a good feel for the power of middleware.

You can also ﬁnd a number of community-contributed examples on Django’s wiki:

http://code.djangoproject.com/wiki/ContributedMiddleware

19.4 Built-in Middleware

Django comes with some built-in middleware to deal with common problems, which we discuss in the sections that

follow.

19.4.1 Authentication Support Middleware

Middleware class: django.contrib.auth.middleware.AuthenticationMiddleware.

This middleware enables authentication support. It adds the request.user attribute, representing the currently

logged-in user, to every incoming HttpRequest object.

See Chapter 14 for complete details.

19.4.2 “Common” Middleware

Middleware class: django.middleware.common.CommonMiddleware.

This middleware adds a few conveniences for perfectionists:

•Forbids access to user agents in the ‘‘DISALLOWED_USER_AGENTS‘‘ setting: If provided, this setting should

be a list of compiled regular expression objects that are matched against the user-agent header for each incoming

request. Here’s an example snippet from a settings ﬁle:

import re

DISALLOWED_USER_AGENTS =(

re.compile(r’^OmniExplorer_Bot’),

266 Chapter 19. Chapter 17: Middleware

The Django Book, Release 2.0

re.compile(r’^Googlebot’)

)

Note the import re, because DISALLOWED_USER_AGENTS requires its values to be compiled regexes

(i.e., the output of re.compile()). The settings ﬁle is regular Python, so it’s perfectly OK to include Python

import statements in it.

•Performs URL rewriting based on the ‘‘APPEND_SLASH‘‘ and ‘‘PREPEND_WWW‘‘ settings: If

APPEND_SLASH is True, URLs that lack a trailing slash will be redirected to the same URL with a trail-

ing slash, unless the last component in the path contains a period. So foo.com/bar is redirected to

foo.com/bar/, but foo.com/bar/file.txt is passed through unchanged.

If PREPEND_WWW is True, URLs that lack a leading “www.” will be redirected to the same URL with a leading

“www.”.

Both of these options are meant to normalize URLs. The philosophy is that each URL should exist in one – and

only one – place. Technically the URL example.com/bar is distinct from example.com/bar/, which

in turn is distinct from www.example.com/bar/. A search-engine indexer would treat these as separate

URLs, which is detrimental to your site’s search-engine rankings, so it’s a best practice to normalize URLs.

•Handles ETags based on the ‘‘USE_ETAGS‘‘ setting:ETags are an HTTP-level optimization for caching pages

conditionally. If USE_ETAGS is set to True, Django will calculate an ETag for each request by MD5-hashing

the page content, and it will take care of sending Not Modified responses, if appropriate.

Note there is also a conditional GET middleware, covered shortly, which handles ETags and does a bit more.

19.4.3 Compression Middleware

Middleware class: django.middleware.gzip.GZipMiddleware.

This middleware automatically compresses content for browsers that understand gzip compression (all modern

browsers). This can greatly reduce the amount of bandwidth a Web server consumes. The tradeoff is that it takes

a bit of processing time to compress pages.

We usually prefer speed over bandwidth, but if you prefer the reverse, just enable this middleware.

19.4.4 Conditional GET Middleware

Middleware class: django.middleware.http.ConditionalGetMiddleware.

This middleware provides support for conditional GET operations. If the response has an Last-Modified or ETag

or header, and the request has If-None-Match or If-Modified-Since, the response is replaced by an 304

(“Not modiﬁed”) response. ETag support depends on on the USE_ETAGS setting and expects the ETag response

header to already be set. As discussed above, the ETag header is set by the Common middleware.

It also removes the content from any response to a HEAD request and sets the Date and Content-Length response

headers for all requests.

19.4.5 Reverse Proxy Support (X-Forwarded-For Middleware)

Middleware class: django.middleware.http.SetRemoteAddrFromForwardedFor.

This is the example we examined in the “What’s Middleware?” section earlier. It sets

request.META[’REMOTE_ADDR’] based on request.META[’HTTP_X_FORWARDED_FOR’], if the

latter is set. This is useful if you’re sitting behind a reverse proxy that causes each request’s REMOTE_ADDR to be set

to 127.0.0.1.

19.4. Built-in Middleware 267

The Django Book, Release 2.0

Danger!

This middleware does not validate HTTP_X_FORWARDED_FOR.

If you’re not behind a reverse proxy that sets HTTP_X_FORWARDED_FOR automatically, do not use this middle-

ware. Anybody can spoof the value of HTTP_X_FORWARDED_FOR, and because this sets REMOTE_ADDR based on

HTTP_X_FORWARDED_FOR, that means anybody can fake his IP address.

Only use this middleware when you can absolutely trust the value of HTTP_X_FORWARDED_FOR.

19.4.6 Session Support Middleware

Middleware class: django.contrib.sessions.middleware.SessionMiddleware.

This middleware enables session support. See Chapter 14 for details.

19.4.7 Sitewide Cache Middleware

Middleware classes: django.middleware.cache.UpdateCacheMiddleware and

django.middleware.cache.FetchFromCacheMiddleware.

These middlewares work together to cache each Django-powered page. This was discussed in detail in Chapter 15.

19.4.8 Transaction Middleware

Middleware class: django.middleware.transaction.TransactionMiddleware.

This middleware binds a database COMMIT or ROLLBACK to the request/response phase. If a view function runs

successfully, a COMMIT is issued. If the view raises an exception, a ROLLBACK is issued.

The order of this middleware in the stack is important. Middleware modules running outside of it run with commit-

on-save – the default Django behavior. Middleware modules running inside it (coming later in the stack) will be under

the same transaction control as the view functions.

See Appendix B for more about information about database transactions.

19.5 What’s Next?

Web developers and database-schema designers don’t always have the luxury of starting from scratch. In the next

chapter, we’ll cover how to integrate with legacy systems, such as database schemas you’ve inherited from the 1980s.

268 Chapter 19. Chapter 17: Middleware

CHAPTER 20

Chapter 18: Integrating with Legacy Databases and Applications

Django is best suited for so-called green-ﬁeld development – that is, starting projects from scratch, as if you were

constructing a building on a fresh ﬁeld of green grass. But despite the fact that Django favors from-scratch projects,

it’s possible to integrate the framework into legacy databases and applications. This chapter explains a few integration

strategies.

20.1 Integrating with a Legacy Database

Django’s database layer generates SQL schemas from Python code – but with a legacy database, you already have

the SQL schemas. In such a case, you’ll need to create models for your existing database tables. For this purpose,

Django comes with a tool that can generate model code by reading your database table layouts. This tool is called

inspectdb, and you can call it by executing the command manage.py inspectdb.

20.1.1 Using inspectdb

The inspectdb utility introspects the database pointed to by your settings ﬁle, determines a Django model repre-

sentation for each of your tables, and prints the Python model code to standard output.

Here’s a walk-through of a typical legacy database integration process from scratch. The only assumptions are that

Django is installed and that you have a legacy database.

1. Create a Django project by running django-admin.py startproject mysite (where mysite is

your project’s name). We’ll use mysite as the project name in this example.

2. Edit the settings ﬁle in that project, mysite/settings.py, to tell Django what your database connec-

tion parameters are and what the name of the database is. Speciﬁcally, provide the DATABASE_NAME,

DATABASE_ENGINE,DATABASE_USER,DATABASE_PASSWORD,DATABASE_HOST, and

DATABASE_PORT settings. (Note that some of these settings are optional. Refer to Chapter 5 for more

information.)

3. Create a Django application within your project by running python mysite/manage.py startapp

myapp (where myapp is your application’s name). We’ll use myapp as the application name here.

4. Run the command python mysite/manage.py inspectdb. This will examine the tables in the

DATABASE_NAME database and print the generated model class for each table. Take a look at the output to

get an idea of what inspectdb can do.

5. Save the output to the models.py ﬁle within your application by using standard shell output redirection:

python mysite/manage.py inspectdb > mysite/myapp/models.py

269

The Django Book, Release 2.0

6. Edit the mysite/myapp/models.py ﬁle to clean up the generated models and make any necessary cus-

tomizations. We’ll give some hints for this in the next section.

20.1.2 Cleaning Up Generated Models

As you might expect, the database introspection isn’t perfect, and you’ll need to do some light cleanup of the resulting

model code. Here are a few pointers for dealing with the generated models:

1. Each database table is converted to a model class (i.e., there is a one-to-one mapping between database tables

and model classes). This means that you’ll need to refactor the models for any many-to-many join tables into

ManyToManyField objects.

2. Each generated model has an attribute for every ﬁeld, including id primary key ﬁelds. However, recall that

Django automatically adds an id primary key ﬁeld if a model doesn’t have a primary key. Thus, you’ll want to

remove any lines that look like this:

id =models.IntegerField(primary_key=True)

Not only are these lines redundant, but also they can cause problems if your application will be adding new

records to these tables.

3. Each ﬁeld’s type (e.g., CharField,DateField) is determined by looking at the database column type (e.g.,

VARCHAR,DATE). If inspectdb cannot map a column’s type to a model ﬁeld type, it will use TextField

and will insert the Python comment ’This field type is a guess.’ next to the ﬁeld in the generated

model. Keep an eye out for that, and change the ﬁeld type accordingly if needed.

If a ﬁeld in your database has no good Django equivalent, you can safely leave it off. The Django model layer

is not required to include every ﬁeld in your table(s).

4. If a database column name is a Python reserved word (such as pass,class, or for), inspectdb will

append ’_field’ to the attribute name and set the db_column attribute to the real ﬁeld name (e.g., pass,

class, or for).

For example, if a table has an INT column called for, the generated model will have a ﬁeld like this:

for_field =models.IntegerField(db_column=’for’)

inspectdb will insert the Python comment ’Field renamed because it was a Python

reserved word.’ next to the ﬁeld.

5. If your database contains tables that refer to other tables (as most databases do), you might need to rearrange

the order of the generated models so that models that refer to other models are ordered properly. For example, if

model Book has a ForeignKey to model Author, model Author should be deﬁned before model Book.

If you need to create a relationship on a model that has not yet been deﬁned, you can use a string containing the

name of the model, rather than the model object itself.

6. inspectdb detects primary keys for PostgreSQL, MySQL, and SQLite. That is, it inserts

primary_key=True where appropriate. For other databases, you’ll need to insert primary_key=True

for at least one ﬁeld in each model, because Django models are required to have a primary_key=True ﬁeld.

7. Foreign-key detection only works with PostgreSQL and with certain types of MySQL tables. In other cases,

foreign-key ﬁelds will be generated as IntegerField‘‘s, assuming the foreign-key column

was an ‘‘INT column.

270 Chapter 20. Chapter 18: Integrating with Legacy Databases and Applications

The Django Book, Release 2.0

20.2 Integrating with an Authentication System

It’s possible to integrate Django with an existing authentication system – another source of usernames and passwords

or authentication methods.

For example, your company may already have an LDAP setup that stores a username and password for every employee.

It would be a hassle for both the network administrator and the users themselves if users had separate accounts in LDAP

and the Django-based applications.

To handle situations like this, the Django authentication system lets you plug in other authentication sources. You can

override Django’s default database-based scheme, or you can use the default system in tandem with other systems.

20.2.1 Specifying Authentication Backends

Behind the scenes, Django maintains a list of “authentication backends” that it checks for authentication. When

somebody calls django.contrib.auth.authenticate() (as described in Chapter 14), Django tries authen-

ticating across all of its authentication backends. If the ﬁrst authentication method fails, Django tries the second one,

and so on, until all backends have been attempted.

The list of authentication backends to use is speciﬁed in the AUTHENTICATION_BACKENDS setting. This should

be a tuple of Python path names that point to Python classes that know how to authenticate. These classes can be

anywhere on your Python path.

By default, AUTHENTICATION_BACKENDS is set to the following:

(’django.contrib.auth.backends.ModelBackend’,)

That’s the basic authentication scheme that checks the Django users database.

The order of AUTHENTICATION_BACKENDS matters, so if the same username and password are valid in multiple

backends, Django will stop processing at the ﬁrst positive match.

20.2.2 Writing an Authentication Backend

An authentication backend is a class that implements two methods: get_user(id) and

authenticate(**credentials).

The get_user method takes an id – which could be a username, database ID, or whatever – and returns a User

object.

The authenticate method takes credentials as keyword arguments. Most of the time it looks like this:

class MyBackend(object):

def authenticate(self, username=None, password=None):

# Check the username/password and return a User.

But it could also authenticate a token, like so:

class MyBackend(object):

def authenticate(self, token=None):

# Check the token and return a User.

Either way, authenticate should check the credentials it gets, and it should return a User object that matches

those credentials, if the credentials are valid. If they’re not valid, it should return None.

The Django admin system is tightly coupled to Django’s own database-backed User object described in Chapter 14.

The best way to deal with this is to create a Django User object for each user that exists for your backend (e.g., in

20.2. Integrating with an Authentication System 271

The Django Book, Release 2.0

your LDAP directory, your external SQL database, etc.). Either you can write a script to do this in advance or your

authenticate method can do it the ﬁrst time a user logs in.

Here’s an example backend that authenticates against a username and password variable deﬁned in your

settings.py ﬁle and creates a Django User object the ﬁrst time a user authenticates:

from django.conf import settings

from django.contrib.auth.models import User, check_password

class SettingsBackend(object):

"""

Authenticate against the settings ADMIN_LOGIN and ADMIN_PASSWORD.

Use the login name, and a hash of the password. For example:

ADMIN_LOGIN = ’admin’

ADMIN_PASSWORD = ’sha1$4e987$afbcf42e21bd417fb71db8c66b321e9fc33051de’

"""

def authenticate(self, username=None, password=None):

login_valid =(settings.ADMIN_LOGIN == username)

pwd_valid =check_password(password, settings.ADMIN_PASSWORD)

if login_valid and pwd_valid:

try:

user =User.objects.get(username=username)

except User.DoesNotExist:

# Create a new user. Note that we can set password

# to anything, because it won’t be checked; the password

# from settings.py will.

user =User(username=username, password=’get from settings.py’)

user.is_staff =True

user.is_superuser =True

user.save()

return user

return None

def get_user(self, user_id):

try:

return User.objects.get(pk=user_id)

except User.DoesNotExist:

return None

For more on authentication backends, see the ofﬁcial Django documentation.

20.3 Integrating with Legacy Web Applications

It’s possible to run a Django application on the same Web server as an application powered by another technology.

The most straightforward way of doing this is to use Apache’s conﬁguration ﬁle, httpd.conf, to delegate different

URL patterns to different technologies. (Note that Chapter 12 covers Django deployment on Apache/mod_python, so

it might be worth reading that chapter ﬁrst before attempting this integration.)

The key is that Django will be activated for a particular URL pattern only if your httpd.conf ﬁle says so. The

default deployment explained in Chapter 12 assumes you want Django to power every page on a particular domain:

SetHandler python-program

PythonHandler django.core.handlers.modpython

SetEnv DJANGO_SETTINGS_MODULE mysite.settings

272 Chapter 20. Chapter 18: Integrating with Legacy Databases and Applications

The Django Book, Release 2.0

PythonDebug On

</Location>

Here, the <Location "/"> line means “handle every URL, starting at the root,” with Django.

It’s perfectly ﬁne to limit this <Location> directive to a certain directory tree. For example, say you have a legacy

PHP application that powers most pages on a domain and you want to install a Django admin site at /admin/ without

disrupting the PHP code. To do this, just set the <Location> directive to /admin/:

SetHandler python-program

PythonHandler django.core.handlers.modpython

SetEnv DJANGO_SETTINGS_MODULE mysite.settings

PythonDebug On

</Location>

With this in place, only the URLs that start with /admin/ will activate Django. Any other page will use whatever

infrastructure already existed.

Note that attaching Django to a qualiﬁed URL (such as /admin/ in this section’s example) does not affect the Django

URL parsing. Django works with the absolute URL (e.g., /admin/people/person/add/), not a “stripped”

version of the URL (e.g., /people/person/add/). This means that your root URLconf should include the leading

/admin/.

20.4 What’s Next?

If you’re a native English speaker, you might not have noticed one of the coolest features of Django’s admin site: it’s

available in more than 50 different languages! This is made possible by Django’s internationalization framework (and

the hard work of Django’s volunteer translators). The next chapter explains how to use this framework to provide

localized Django sites.

20.4. What’s Next? 273

The Django Book, Release 2.0

274 Chapter 20. Chapter 18: Integrating with Legacy Databases and Applications

CHAPTER 21

Chapter 19: Internationalization

Django was originally developed right in the middle of the United States – quite literally, as Lawrence, Kansas, is less

than 40 miles from the geographic center of the continental United States. Like most open source projects, though,

Django’s community grew to include people from all over the globe. As Django’s community became increasingly

diverse, internationalization and localization became increasingly important. Because many developers have at best a

fuzzy understanding of these terms, we’ll deﬁne them brieﬂy.

Internationalization refers to the process of designing programs for the potential use of any locale. This includes

marking text (such as UI elements and error messages) for future translation, abstracting the display of dates and times

so that different local standards may be observed, providing support for differing time zones, and generally making

sure that the code contains no assumptions about the location of its users. You’ll often see “internationalization”

abbreviated I18N. (The “18” refers to the number of letters omitted between the initial “I” and the terminal “N.”)

Localization refers to the process of actually translating an internationalized program for use in a particular locale.

You’ll sometimes see “localization” abbreviated as L10N.

Django itself is fully internationalized; all strings are marked for translation, and settings control the display of locale-

dependent values like dates and times. Django also ships with more than 50 different localization ﬁles. If you’re not a

native English speaker, there’s a good chance that Django is already translated into your primary language.

The same internationalization framework used for these localizations is available for you to use in your own code and

templates.

To use this framework, you’ll need to add a minimal number of hooks to your Python code and templates. These

hooks are called translation strings. They tell Django, “This text should be translated into the end user’s language, if

a translation for this text is available in that language.”

Django takes care of using these hooks to translate Web applications, on the ﬂy, according to users’ language prefer-

ences.

Essentially, Django does two things:

• It lets developers and template authors specify which parts of their applications should be translatable.

• It uses that information to translate Web applications for particular users according to their language preferences.

Note: Django’s translation machinery uses GNU gettext (http://www.gnu.org/software/gettext/) via the standard

gettext module that comes with Python.

If You Don’t Need Internationalization:

Django’s internationalization hooks are enabled by default, which incurs a small bit of overhead. If you don’t use

internationalization, you should set USE_I18N = False in your settings ﬁle. If USE_I18N is set to False, then

Django will make some optimizations so as not to load the internationalization machinery.

275

The Django Book, Release 2.0

You’ll probably also want to remove ’django.core.context_processors.i18n’ from your

TEMPLATE_CONTEXT_PROCESSORS setting.

The three steps for internationalizing your Django application are:

1. Embed translation strings in your Python code and templates.

2. Get translations for those strings, in whichever languages you want to support.

3. Activate the locale middleware in your Django settings.

We’ll cover each one of these steps in detail.

21.1 1. How to Specify Translation Strings

Translation strings specify “This text should be translated.” These strings can appear in your Python code and tem-

plates. It’s your responsibility to mark translatable strings; the system can only translate strings it knows about.

21.1.1 In Python Code

Standard Translation

Specify a translation string by using the function ugettext(). It’s convention to import this as a shorter alias, _, to

save typing.

In this example, the text "Welcome to my site." is marked as a translation string:

from django.utils.translation import ugettext as _

def my_view(request):

output =_("Welcome to my site.")

return HttpResponse(output)

Obviously, you could code this without using the alias. This example is identical to the previous one:

from django.utils.translation import ugettext

def my_view(request):

output =ugettext("Welcome to my site.")

return HttpResponse(output)

Translation works on computed values. This example is identical to the previous two:

def my_view(request):

words =[’Welcome’,’to’,’my’,’site.’]

output =_(’ ’.join(words))

return HttpResponse(output)

Translation works on variables. Again, here’s an identical example:

def my_view(request):

sentence =’Welcome to my site.’

output =_(sentence)

return HttpResponse(output)

276 Chapter 21. Chapter 19: Internationalization

The Django Book, Release 2.0

(The caveat with using variables or computed values, as in the previous two examples, is that Django’s translation-

string-detecting utility, django-admin.py makemessages, won’t be able to ﬁnd these strings. More on

makemessages later.)

The strings you pass to _() or ugettext() can take placeholders, speciﬁed with Python’s standard named-string

interpolation syntax. Example:

def my_view(request, m, d):

output =_(’Today is %(month)s %(day)s.’)%{’month’:m,’day’: d}

return HttpResponse(output)

This technique lets language-speciﬁc translations reorder the placeholder text. For example, an English translation

may be "Today is November 26.", while a Spanish translation may be "Hoy es 26 de Noviembre."

– with the placeholders (the month and the day) with their positions swapped.

For this reason, you should use named-string interpolation (e.g., %(day)s) instead of positional interpolation (e.g.,

%s or %d) whenever you have more than a single parameter. If you used positional interpolation, translations wouldn’t

be able to reorder placeholder text.

Marking Strings as No-Op

Use the function django.utils.translation.ugettext_noop() to mark a string as a translation string

without translating it. The string is later translated from a variable.

Use this if you have constant strings that should be stored in the source language because they are exchanged over

systems or users – such as strings in a database – but should be translated at the last possible point in time, such as

when the string is presented to the user.

Lazy Translation

Use the function django.utils.translation.ugettext_lazy() to translate strings lazily – when the

value is accessed rather than when the ugettext_lazy() function is called.

For example, to translate a model’s help_text, do the following:

from django.utils.translation import ugettext_lazy

class MyThing(models.Model):

name =models.CharField(help_text=ugettext_lazy(’This is the help text’))

In this example, ugettext_lazy() stores a lazy reference to the string – not the actual translation. The translation

itself will be done when the string is used in a string context, such as template rendering on the Django admin site.

The result of a ugettext_lazy() call can be used wherever you would use a unicode string (an object with type

unicode) in Python. If you try to use it where a bytestring (a str object) is expected, things will not work as

expected, since a ugettext_lazy() object doesn’t know how to convert itself to a bytestring. You can’t use a

unicode string inside a bytestring, either, so this is consistent with normal Python behavior. For example:

# This is fine: putting a unicode proxy into a unicode string.

u"Hello %s"%ugettext_lazy("people")

# This will not work, since you cannot insert a unicode object

# into a bytestring (nor can you insert our unicode proxy there)

"Hello %s"%ugettext_lazy("people")

If you ever see output that looks like "hello <django.utils.functional...>", you have tried to insert

the result of ugettext_lazy() into a bytestring. That’s a bug in your code.

21.1. 1. How to Specify Translation Strings 277

The Django Book, Release 2.0

If you don’t like the verbose name ugettext_lazy, you can just alias it as _(underscore), like so:

from django.utils.translation import ugettext_lazy as _

class MyThing(models.Model):

name =models.CharField(help_text=_(’This is the help text’))

Always use lazy translations in Django models. Field names and table names should be marked for translation

(otherwise, they won’t be translated in the admin interface). This means writing explicit verbose_name and

verbose_name_plural options in the Meta class, though, rather than relying on Django’s default determina-

tion of verbose_name and verbose_name_plural by looking at the model’s class name:

from django.utils.translation import ugettext_lazy as _

class MyThing(models.Model):

name =models.CharField(_(’name’), help_text=_(’This is the help text’))

class Meta:

verbose_name =_(’my thing’)

verbose_name_plural =_(’mythings’)

Pluralization

Use the function django.utils.translation.ungettext() to specify pluralized messages. Example:

from django.utils.translation import ungettext

def hello_world(request, count):

page =ungettext(’there is %(count)d object’,

’there are %(count)d objects’, count) %{

’count’: count,

}

return HttpResponse(page)

ungettext takes three arguments: the singular translation string, the plural translation string and the number of

objects (which is passed to the translation languages as the count variable).

21.1.2 In Template Code

Translation in Django templates uses two template tags and a slightly different syntax than in Python code. To give

your template access to these tags, put {% load i18n %} toward the top of your template.

The {% trans %} template tag translates either a constant string (enclosed in single or double quotes) or variable

content:

<title>{% trans "This is the title." %}</title>

<title>{% trans myvar %}</title>

If the noop option is present, variable lookup still takes place but the translation is skipped. This is useful when

“stubbing out” content that will require translation in the future:

<title>{% trans "myvar" noop %}</title>

It’s not possible to mix a template variable inside a string within {% trans %}. If your translations require strings

with variables (placeholders), use {% blocktrans %}:

{% blocktrans %}This string will have {{ value }} inside.{% endblocktrans %}

278 Chapter 21. Chapter 19: Internationalization

The Django Book, Release 2.0

To translate a template expression – say, using template ﬁlters – you need to bind the expression to a local variable for

use within the translation block:

{% blocktrans with value|filter as myvar %}

This will have {{ myvar }} inside.

{% endblocktrans %}

If you need to bind more than one expression inside a blocktrans tag, separate the pieces with and:

{% blocktrans with book|title as book_t and author|title as author_t %}

This is {{ book_t }} by {{ author_t }}

{% endblocktrans %}

To pluralize, specify both the singular and plural forms with the {% plural %} tag, which appears within {%

blocktrans %} and {% endblocktrans %}. Example:

{% blocktrans count list|length as counter %}

There is only one {{ name }} object.

{% plural %}

There are {{ counter }} {{ name }} objects.

{% endblocktrans %}

Internally, all block and inline translations use the appropriate ugettext /ungettext call.

Each RequestContext has access to three translation-speciﬁc variables:

•LANGUAGES is a list of tuples in which the ﬁrst element is the language code and the second is the language

name (translated into the currently active locale).

•LANGUAGE_CODE is the current user’s preferred language, as a string. Example: en-us. (See “How Django

discovers language preference,” below.)

•LANGUAGE_BIDI is the current locale’s direction. If True, it’s a right-to-left language, e.g.: Hebrew, Arabic.

If False it’s a left-to-right language, e.g.: English, French, German etc.

If you don’t use the RequestContext extension, you can get those values with three tags:

{% get_current_language as LANGUAGE_CODE %}

{% get_available_languages as LANGUAGES %}

{% get_current_language_bidi as LANGUAGE_BIDI %}

These tags also require a {% load i18n %}.

Translation hooks are also available within any template block tag that accepts constant strings. In those cases, just

use _() syntax to specify a translation string:

{% some_special_tag _("Page not found") value|yesno:_("yes,no") %}

In this case, both the tag and the ﬁlter will see the already-translated string, so they don’t need to be aware of transla-

tions.

Note: In this example, the translation infrastructure will be passed the string "yes,no", not the individual strings

"yes" and "no". The translated string will need to contain the comma so that the ﬁlter parsing code knows how

to split up the arguments. For example, a German translator might translate the string "yes,no" as "ja,nein"

(keeping the comma intact).

21.1. 1. How to Specify Translation Strings 279

The Django Book, Release 2.0

21.1.3 Working With Lazy Translation Objects

Using ugettext_lazy() and ungettext_lazy() to mark strings in models and utility functions is a common

operation. When you’re working with these objects elsewhere in your code, you should ensure that you don’t acci-

dentally convert them to strings, because they should be converted as late as possible (so that the correct locale is in

effect). This necessitates the use of a couple of helper functions.

Joining Strings: string_concat()

Standard Python string joins (’’.join([...])) will not work on lists containing lazy translation objects. Instead,

you can use django.utils.translation.string_concat(), which creates a lazy object that concatenates

its contents and converts them to strings only when the result is included in a string. For example:

from django.utils.translation import string_concat

# ...

name =ugettext_lazy(u’John Lennon’)

instrument =ugettext_lazy(u’guitar’)

result =string_concat([name, ’: ’, instrument])

In this case, the lazy translations in result will only be converted to strings when result itself is used in a string

(usually at template rendering time).

The allow_lazy() Decorator

Django offers many utility functions (particularly in django.utils) that take a string as their ﬁrst argument and

do something to that string. These functions are used by template ﬁlters as well as directly in other code.

If you write your own similar functions and deal with translations, you’ll face the problem of what to do when the ﬁrst

argument is a lazy translation object. You don’t want to convert it to a string immediately, because you might be using

this function outside of a view (and hence the current thread’s locale setting will not be correct).

For cases like this, use the django.utils.functional.allow_lazy() decorator. It modiﬁes the function

so that if it’s called with a lazy translation as the ﬁrst argument, the function evaluation is delayed until it needs to be

converted to a string.

For example:

from django.utils.functional import allow_lazy

def fancy_utility_function(s, ...):

# Do some conversion on string ’s’

# ...

fancy_utility_function = allow_lazy(fancy_utility_function, unicode)

The allow_lazy() decorator takes, in addition to the function to decorate, a number of extra arguments (*args)

specifying the type(s) that the original function can return. Usually, it’s enough to include unicode here and ensure

that your function returns only Unicode strings.

Using this decorator means you can write your function and assume that the input is a proper string, then add support

for lazy translation objects at the end.

21.2 2. How to Create Language Files

Once you’ve tagged your strings for later translation, you need to write (or obtain) the language translations them-

selves. Here’s how that works.

280 Chapter 21. Chapter 19: Internationalization

The Django Book, Release 2.0

Locale restrictions

Django does not support localizing your application into a locale for which Django itself has not been translated. In

this case, it will ignore your translation ﬁles. If you were to try this and Django supported it, you would inevitably see

a mixture of translated strings (from your application) and English strings (from Django itself). If you want to support

a locale for your application that is not already part of Django, you’ll need to make at least a minimal translation of

the Django core.

21.2.1 Message Files

The ﬁrst step is to create a message ﬁle for a new language. A message ﬁle is a plain-text ﬁle, representing a single

language, that contains all available translation strings and how they should be represented in the given language.

Message ﬁles have a .po ﬁle extension.

Django comes with a tool, django-admin.py makemessages, that automates the creation and upkeep of these

ﬁles. To create or update a message ﬁle, run this command:

django-admin.py makemessages -l de

...where de is the language code for the message ﬁle you want to create. The language code, in this case, is in locale

format. For example, it’s pt_BR for Brazilian Portuguese and de_AT for Austrian German.

The script should be run from one of three places:

• The root directory of your Django project.

• The root directory of your Django app.

• The root django directory (not a Subversion checkout, but the one that is linked-to via $PYTHONPATH or is

located somewhere on that path). This is only relevant when you are creating a translation for Django itself.

This script runs over your project source tree or your application source tree and pulls out all strings marked for trans-

lation. It creates (or updates) a message ﬁle in the directory locale/LANG/LC_MESSAGES. In the de example,

the ﬁle will be locale/de/LC_MESSAGES/django.po.

By default django-admin.py makemessages examines every ﬁle that has the .html ﬁle extension. In case

you want to override that default, use the --extension or -e option to specify the ﬁle extensions to examine:

django-admin.py makemessages -l de -e txt

Separate multiple extensions with commas and/or use -e or --extension multiple times:

django-admin.py makemessages -l de -e html,txt -e xml

When creating JavaScript translation catalogs (which we’ll cover later in this chapter,) you need to use the special

‘djangojs’ domain, not -e js.

No gettext?

If you don’t have the gettext utilities installed, django-admin.py makemessages will create empty

ﬁles. If that’s the case, either install the gettext utilities or just copy the English message ﬁle

(locale/en/LC_MESSAGES/django.po) if available and use it as a starting point; it’s just an empty translation

ﬁle.

Working on Windows?

If you’re using Windows and need to install the GNU gettext utilities so django-admin makemessages works,

see the “gettext on Windows” section below for more information.

21.2. 2. How to Create Language Files 281

The Django Book, Release 2.0

The format of .po ﬁles is straightforward. Each .po ﬁle contains a small bit of metadata, such as the translation

maintainer’s contact information, but the bulk of the ﬁle is a list of messages – simple mappings between translation

strings and the actual translated text for the particular language.

For example, if your Django app contained a translation string for the text "Welcome to my site.", like so:

_("Welcome to my site.")

...then django-admin.py makemessages will have created a .po ﬁle containing the following snippet – a

message:

#: path/to/python/module.py:23

msgid "Welcome to my site."

msgstr ""

A quick explanation:

•msgid is the translation string, which appears in the source. Don’t change it.

•msgstr is where you put the language-speciﬁc translation. It starts out empty, so it’s your responsibility to

change it. Make sure you keep the quotes around your translation.

• As a convenience, each message includes, in the form of a comment line preﬁxed with #and located above the

msgid line, the ﬁlename and line number from which the translation string was gleaned.

Long messages are a special case. There, the ﬁrst string directly after the msgstr (or msgid) is an empty string. Then

the content itself will be written over the next few lines as one string per line. Those strings are directly concatenated.

Don’t forget trailing spaces within the strings; otherwise, they’ll be tacked together without whitespace!

To reexamine all source code and templates for new translation strings and update all message ﬁles for all languages,

run this:

django-admin.py makemessages -a

21.2.2 Compiling Message Files

After you create your message ﬁle – and each time you make changes to it – you’ll need to compile it into a more

efﬁcient form, for use by gettext. Do this with the django-admin.py compilemessages utility.

This tool runs over all available .po ﬁles and creates .mo ﬁles, which are binary ﬁles optimized for use by gettext.

In the same directory from which you ran django-admin.py makemessages, run django-admin.py

compilemessages like this:

django-admin.py compilemessages

That’s it. Your translations are ready for use.

21.3 3. How Django Discovers Language Preference

Once you’ve prepared your translations – or, if you just want to use the translations that come with Django – you’ll

just need to activate translation for your app.

Behind the scenes, Django has a very ﬂexible model of deciding which language should be used – installation-wide,

for a particular user, or both.

To set an installation-wide language preference, set LANGUAGE_CODE. Django uses this language as the default

translation – the ﬁnal attempt if no other translator ﬁnds a translation.

282 Chapter 21. Chapter 19: Internationalization

The Django Book, Release 2.0

If all you want to do is run Django with your native language, and a language ﬁle is available for your language, all

you need to do is set LANGUAGE_CODE.

If you want to let each individual user specify which language he or she prefers, use LocaleMiddleware.

LocaleMiddleware enables language selection based on data from the request. It customizes content for each

user.

To use LocaleMiddleware, add ’django.middleware.locale.LocaleMiddleware’ to your

MIDDLEWARE_CLASSES setting. Because middleware order matters, you should follow these guidelines:

• Make sure it’s one of the ﬁrst middlewares installed.

• It should come after SessionMiddleware, because LocaleMiddleware makes use of session data.

• If you use CacheMiddleware, put LocaleMiddleware after it.

For example, your MIDDLEWARE_CLASSES might look like this:

MIDDLEWARE_CLASSES =(

’django.contrib.sessions.middleware.SessionMiddleware’,

’django.middleware.locale.LocaleMiddleware’,

’django.middleware.common.CommonMiddleware’,

)

(For more on middleware, see Chapter 17.)

LocaleMiddleware tries to determine the user’s language preference by following this algorithm:

• First, it looks for a django_language key in the current user’s session.

• Failing that, it looks for a cookie.

• Failing that, it looks at the Accept-Language HTTP header. This header is sent by your browser and tells

the server which language(s) you prefer, in order by priority. Django tries each language in the header until it

ﬁnds one with available translations.

• Failing that, it uses the global LANGUAGE_CODE setting.

Notes:

• In each of these places, the language preference is expected to be in the standard language format, as a string.

For example, Brazilian Portuguese is pt-br.

• If a base language is available but the sublanguage speciﬁed is not, Django uses the base language. For example,

if a user speciﬁes de-at (Austrian German) but Django only has de available, Django uses de.

• Only languages listed in the LANGUAGES setting can be selected. If you want to restrict the language selection to

a subset of provided languages (because your application doesn’t provide all those languages), set LANGUAGES

to a list of languages. For example:

LANGUAGES =(

(’de’, _(’German’)),

(’en’, _(’English’)),

)

This example restricts languages that are available for automatic selection to German and English (and any

sublanguage, like de-ch or en-us).

• If you deﬁne a custom LANGUAGES setting, as explained in the previous bullet, it’s OK to mark

the languages as translation strings – but use a “dummy” ugettext() function, not the one in

django.utils.translation. You should never import django.utils.translation from within

your settings ﬁle, because that module in itself depends on the settings, and that would cause a circular import.

The solution is to use a “dummy” ugettext() function. Here’s a sample settings ﬁle:

21.3. 3. How Django Discovers Language Preference 283

The Django Book, Release 2.0

ugettext =lambda s: s

LANGUAGES =(

(’de’, ugettext(’German’)),

(’en’, ugettext(’English’)),

)

With this arrangement, django-admin.py makemessages will still ﬁnd and mark these strings for trans-

lation, but the translation won’t happen at runtime – so you’ll have to remember to wrap the languages in the

real ugettext() in any code that uses LANGUAGES at runtime.

• The LocaleMiddleware can only select languages for which there is a Django-provided base translation.

If you want to provide translations for your application that aren’t already in the set of translations in Django’s

source tree, you’ll want to provide at least basic translations for that language. For example, Django uses

technical message IDs to translate date formats and time formats – so you will need at least those translations

for the system to work correctly.

A good starting point is to copy the English .po ﬁle and to translate at least the technical messages – maybe the

validation messages, too.

Technical message IDs are easily recognized; they’re all upper case. You don’t translate the message ID as

with other messages, you provide the correct local variant on the provided English value. For example, with

DATETIME_FORMAT (or DATE_FORMAT or TIME_FORMAT), this would be the format string that you want

to use in your language. The format is identical to the format strings used by the now template tag.

Once LocaleMiddleware determines the user’s preference, it makes this preference available as

request.LANGUAGE_CODE for each HttpRequest. Feel free to read this value in your view code. Here’s a

simple example:

def hello_world(request):

if request.LANGUAGE_CODE == ’de-at’:

return HttpResponse("You prefer to read Austrian German.")

else:

return HttpResponse("You prefer to read another language.")

Note that, with static (middleware-less) translation, the language is in settings.LANGUAGE_CODE, while with

dynamic (middleware) translation, it’s in request.LANGUAGE_CODE.

21.4 Using Translations in Your Own Projects

Django looks for translations by following this algorithm:

• First, it looks for a locale directory in the application directory of the view that’s being called. If it ﬁnds a

translation for the selected language, the translation will be installed.

• Next, it looks for a locale directory in the project directory. If it ﬁnds a translation, the translation will be

installed.

• Finally, it checks the Django-provided base translation in django/conf/locale.

This way, you can write applications that include their own translations, and you can override base translations in

your project path. Or, you can just build a big project out of several apps and put all translations into one big project

message ﬁle. The choice is yours.

All message ﬁle repositories are structured the same way. They are:

•$APPPATH/locale/<language>/LC_MESSAGES/django.(po|mo)

•$PROJECTPATH/locale/<language>/LC_MESSAGES/django.(po|mo)

284 Chapter 21. Chapter 19: Internationalization

The Django Book, Release 2.0

• All paths listed in LOCALE_PATHS in your settings ﬁle are searched in that order for

<language>/LC_MESSAGES/django.(po|mo)

•$PYTHONPATH/django/conf/locale/<language>/LC_MESSAGES/django.(po|mo)

To create message ﬁles, you use the same django-admin.py makemessages tool as with the Django message

ﬁles. You only need to be in the right place – in the directory where either the conf/locale (in case of the source

tree) or the locale/ (in case of app messages or project messages) directory are located. And you use the same

django-admin.py compilemessages to produce the binary django.mo ﬁles that are used by gettext.

You can also run django-admin.py compilemessages --settings=path.to.settings to make the

compiler process all the directories in your LOCALE_PATHS setting.

Application message ﬁles are a bit complicated to discover – they need the LocaleMiddleware. If you don’t use

the middleware, only the Django message ﬁles and project message ﬁles will be processed.

Finally, you should give some thought to the structure of your translation ﬁles. If your applications need to be delivered

to other users and will be used in other projects, you might want to use app-speciﬁc translations. But using app-

speciﬁc translations and project translations could produce weird problems with makemessages:makemessages

will traverse all directories below the current path and so might put message IDs into the project message ﬁle that are

already in application message ﬁles.

The easiest way out is to store applications that are not part of the project (and so carry their own translations) outside

the project tree. That way, django-admin.py makemessages on the project level will only translate strings

that are connected to your explicit project and not strings that are distributed independently.

21.5 The set_language Redirect View

As a convenience, Django comes with a view, django.views.i18n.set_language, that sets a user’s language

preference and redirects back to the previous page.

Activate this view by adding the following line to your URLconf:

(r’^i18n/’, include(’django.conf.urls.i18n’)),

(Note that this example makes the view available at /i18n/setlang/.)

The view expects to be called via the POST method, with a language parameter set in request. If session support is

enabled, the view saves the language choice in the user’s session. Otherwise, it saves the language choice in a cookie

that is by default named django_language. (The name can be changed through the LANGUAGE_COOKIE_NAME

setting.)

After setting the language choice, Django redirects the user, following this algorithm:

• Django looks for a next parameter in the POST data.

• If that doesn’t exist, or is empty, Django tries the URL in the Referrer header.

• If that’s empty – say, if a user’s browser suppresses that header – then the user will be redirected to /(the site

root) as a fallback.

Here’s example HTML template code:

{% for lang in LANGUAGES %}

{% endfor %}

</select>

21.5. The set_language Redirect View 285

The Django Book, Release 2.0

</form>

21.6 Translations and JavaScript

Adding translations to JavaScript poses some problems:

• JavaScript code doesn’t have access to a gettext implementation.

• JavaScript code doesn’t have access to .po or .mo ﬁles; they need to be delivered by the server.

• The translation catalogs for JavaScript should be kept as small as possible.

Django provides an integrated solution for these problems: It passes the translations into JavaScript, so you can call

gettext, etc., from within JavaScript.

21.6.1 The javascript_catalog View

The main solution to these problems is the javascript_catalog view, which sends out a JavaScript code library

with functions that mimic the gettext interface, plus an array of translation strings. Those translation strings are

taken from the application, project or Django core, according to what you specify in either the info_dict or the URL.

You hook it up like this:

js_info_dict ={

’packages’: (’your.app.package’,),

}

urlpatterns =patterns(’’,

(r’^jsi18n/$’,’django.views.i18n.javascript_catalog’, js_info_dict),

)

Each string in packages should be in Python dotted-package syntax (the same format as the strings in

INSTALLED_APPS) and should refer to a package that contains a locale directory. If you specify multiple pack-

ages, all those catalogs are merged into one catalog. This is useful if you have JavaScript that uses strings from

different applications.

You can make the view dynamic by putting the packages into the URL pattern:

urlpatterns =patterns(’’,

(r’^jsi18n/(?P<packages>\S+)/$’,’django.views.i18n.javascript_catalog’),

)

With this, you specify the packages as a list of package names delimited by ‘+’ signs in the URL. This is especially use-

ful if your pages use code from different apps and this changes often and you don’t want to pull in one big catalog ﬁle.

As a security measure, these values can only be either django.conf or any package from the INSTALLED_APPS

setting.

21.6.2 Using the JavaScript Translation Catalog

To use the catalog, just pull in the dynamically generated script like this:

286 Chapter 21. Chapter 19: Internationalization

The Django Book, Release 2.0

This is how the admin fetches the translation catalog from the server. When the catalog is loaded, your JavaScript

code can use the standard gettext interface to access it:

document.write(gettext(’this is to be translated’));

There is also an ngettext interface:

var object_cnt = 1 // or 0, or 2, or 3, ...

s = ngettext(’literal for the singular case’,

’literal for the plural case’, object_cnt);

and even a string interpolation function:

function interpolate(fmt, obj, named);

The interpolation syntax is borrowed from Python, so the interpolate function supports both positional and named

interpolation:

• Positional interpolation: obj contains a JavaScript Array object whose elements values are then sequentially

interpolated in their corresponding fmt placeholders in the same order they appear. For example:

fmts = ngettext(’There is %s object. Remaining: %s’,

’There are %s objects. Remaining: %s’, 11);

s = interpolate(fmts, [11, 20]);

// s is ’There are 11 objects. Remaining: 20’

• Named interpolation: This mode is selected by passing the optional boolean named parameter as true. obj

contains a JavaScript object or associative array. For example:

d={

count: 10

total: 50

};

fmts = ngettext(’Total: %(total)s, there is %(count)s object’,

’there are %(count)s of a total of %(total)s objects’, d.count);

s = interpolate(fmts, d, true);

You shouldn’t go over the top with string interpolation, though: this is still JavaScript, so the code has to make repeated

regular-expression substitutions. This isn’t as fast as string interpolation in Python, so keep it to those cases where you

really need it (for example, in conjunction with ngettext to produce proper pluralizations).

21.6.3 Creating JavaScript Translation Catalogs

You create and update the translation catalogs the same way as the other

Django translation catalogs – with the django-admin.py makemessages tool. The only difference is you need to provide

a-d djangojs parameter, like this:

django-admin.py makemessages -d djangojs -l de

This would create or update the translation catalog for JavaScript for German. After updating translation catalogs, just

run django-admin.py compilemessages the same way as you do with normal Django translation catalogs.

21.7 Notes for Users Familiar with gettext

If you know gettext, you might note these specialties in the way Django does translation:

21.7. Notes for Users Familiar with gettext 287

The Django Book, Release 2.0

• The string domain is django or djangojs. This string domain is used to differentiate between different pro-

grams that store their data in a common message-ﬁle library (usually /usr/share/locale/). The django

domain is used for python and template translation strings and is loaded into the global translation catalogs.

The djangojs domain is only used for JavaScript translation catalogs to make sure that those are as small as

possible.

• Django doesn’t use xgettext alone. It uses Python wrappers around xgettext and msgfmt. This is mostly

for convenience.

21.8 gettext on Windows

This is only needed for people who either want to extract message IDs or compile message ﬁles (.po). Translation

work itself just involves editing existing ﬁles of this type, but if you want to create your own message ﬁles, or want to

test or compile a changed message ﬁle, you will need the gettext utilities:

• Download the following zip ﬁles from http://sourceforge.net/projects/gettext

–gettext-runtime-X.bin.woe32.zip

–gettext-tools-X.bin.woe32.zip

–libiconv-X.bin.woe32.zip

• Extract the 3 ﬁles in the same folder (i.e. C:\Program Files\gettext-utils)

• Update the system PATH:

–Control Panel > System > Advanced > Environment Variables

–In the System variables list, click Path, click Edit

–Add ;C:\Program Files\gettext-utils\bin at the end of the Variable value ﬁeld

You may also use gettext binaries you have obtained elsewhere, so long as the xgettext --version com-

mand works properly. Some version 0.14.4 binaries have been found to not support this command. Do not attempt

to use Django translation utilities with a gettext package if the command xgettext --version entered at a

Windows command prompt causes a popup window saying “xgettext.exe has generated errors and will be closed by

Windows”.

21.9 What’s Next?

The ﬁnal chapter focuses on security – how you can help secure your sites and your users from malicious attackers.

288 Chapter 21. Chapter 19: Internationalization

CHAPTER 22

Chapter 20: Security

The Internet can be a scary place.

These days, high-proﬁle security gaffes seem to crop up on a daily basis. We’ve seen viruses spread with amazing

speed, swarms of compromised computers wielded as weapons, a never-ending arms race against spammers, and

many, many reports of identify theft from hacked Web sites.

As Web developers, we have a duty to do what we can to combat these forces of darkness. Every Web developer needs

to treat security as a fundamental aspect of Web programming. Unfortunately, it turns out that implementing security

is hard – attackers need to ﬁnd only a single vulnerability, but defenders have to protect every single one.

Django attempts to mitigate this difﬁculty. It’s designed to automatically protect you from many of the common

security mistakes that new (and even experienced) Web developers make. Still, it’s important to understand what these

problems are, how Django protects you, and – most important – the steps you can take to make your code even more

secure.

First, though, an important disclaimer: We do not intend to present a deﬁnitive guide to every known Web security

exploit, and so we won’t try to explain each vulnerability in a comprehensive manner. Instead, we’ll give a short

synopsis of security problems as they apply to Django.

22.1 The Theme of Web Security

If you learn only one thing from this chapter, let it be this:

Never – under any circumstances – trust data from the browser.

You never know who’s on the other side of that HTTP connection. It might be one of your users, but it just as easily

could be a nefarious cracker looking for an opening.

Any data of any nature that comes from the browser needs to be treated with a healthy dose of paranoia. This includes

data that’s both “in band” (i.e., submitted from Web forms) and “out of band” (i.e., HTTP headers, cookies, and other

request information). It’s trivial to spoof the request metadata that browsers usually add automatically.

Every one of the vulnerabilities discussed in this chapter stems directly from trusting data that comes over the wire

and then failing to sanitize that data before using it. You should make it a general practice to continuously ask, “Where

does this data come from?”

22.2 SQL Injection

SQL injection is a common exploit in which an attacker alters Web page parameters (such as GET/POST data or URLs)

to insert arbitrary SQL snippets that a naive Web application executes in its database directly. It’s probably the most

289

The Django Book, Release 2.0

dangerous – and, unfortunately, one of the most common – vulnerabilities out there.

This vulnerability most commonly crops up when constructing SQL “by hand” from user input. For example, imagine

writing a function to gather a list of contact information from a contact search page. To prevent spammers from

reading every single e-mail in our system, we’ll force the user to type in someone’s username before providing her

e-mail address:

def user_contacts(request):

user =request.GET[’username’]

sql ="SELECT *FROM user_contacts WHERE username = ’%s’;" %username

# execute the SQL here...

Note: In this example, and all similar “don’t do this” examples that follow, we’ve deliberately left out most of the

code needed to make the functions actually work. We don’t want this code to work if someone accidentally takes it

out of context.

Though at ﬁrst this doesn’t look dangerous, it really is.

First, our attempt at protecting our entire e-mail list will fail with a cleverly constructed query. Think about what

happens if an attacker types "’ OR ’a’=’a" into the query box. In that case, the query that the string interpolation

will construct will be:

SELECT *FROM user_contacts WHERE username = ’’ OR ’a’ = ’a’;

Because we allowed unsecured SQL into the string, the attacker’s added OR clause ensures that every single row is

returned.

However, that’s the least scary attack. Imagine what will happen if the attacker submits "’; DELETE FROM

user_contacts WHERE ’a’ = ’a". We’ll end up with this complete query:

SELECT *FROM user_contacts WHERE username = ’’; DELETE FROM user_contacts WHERE ’a’ = ’a’;

Yikes! Our entire contact list would be deleted instantly.

22.2.1 The Solution

Although this problem is insidious and sometimes hard to spot, the solution is simple: never trust user-submitted data,

and always escape it when passing it into SQL.

The Django database API does this for you. It automatically escapes all special SQL parameters, according to the

quoting conventions of the database server you’re using (e.g., PostgreSQL or MySQL).

For example, in this API call:

foo.get_list(bar__exact="’ OR 1=1")

Django will escape the input accordingly, resulting in a statement like this:

SELECT *FROM foos WHERE bar = ’\’ OR 1=1’

Completely harmless.

This applies to the entire Django database API, with a couple of exceptions:

• The where argument to the extra() method. (See Appendix C.) That parameter accepts raw SQL by design.

• Queries done “by hand” using the lower-level database API. (See Chapter 10.)

In each of these cases, it’s easy to keep yourself protected. In each case, avoid string interpolation in favor of passing

in bind parameters. That is, the example we started this section with should be written as follows:

290 Chapter 22. Chapter 20: Security

The Django Book, Release 2.0

from django.db import connection

def user_contacts(request):

user =request.GET[’username’]

sql ="SELECT *FROM user_contacts WHERE username = %s"

cursor =connection.cursor()

cursor.execute(sql, [user])

# ... do something with the results

The low-level execute method takes a SQL string with %s placeholders and automatically escapes and inserts

parameters from the list passed as the second argument. You should always construct custom SQL this way.

Unfortunately, you can’t use bind parameters everywhere in SQL; they’re not allowed as identiﬁers (i.e., table or

column names). Thus, if you need to, say, dynamically construct a list of tables from a POST variable, you’ll need

to escape that name in your code. Django provides a function, django.db.connection.ops.quote_name,

which will escape the identiﬁer according to the current database’s quoting scheme.

22.3 Cross-Site Scripting (XSS)

Cross-site scripting (XSS), is found in Web applications that fail to escape user-submitted content properly before

rendering it into HTML. This allows an attacker to insert arbitrary HTML into your Web page, usually in the form of

<script> tags.

Attackers often use XSS attacks to steal cookie and session information, or to trick users into giving private information

to the wrong person (aka phishing).

This type of attack can take a number of different forms and has almost inﬁnite permutations, so we’ll just look at a

typical example. Consider this extremely simple “Hello, World” view:

from django.http import HttpResponse

def say_hello(request):

name =request.GET.get(’name’,’world’)

return HttpResponse(’<h1>Hello, %s!</h1>’ %name)

This view simply reads a name from a GET parameter and passes that name into the generated HTML. So, if we

accessed http://example.com/hello/?name=Jacob, the page would contain this:

<h1>Hello, Jacob!</h1>

But wait – what happens if we access http://example.com/hello/?name=<i>Jacob</i>? Then we get

this:

<h1>Hello, <i>Jacob</i>!</h1>

Of course, an attacker wouldn’t use something as benign as <i> tags; he could include a whole set of HTML that

hijacked your page with arbitrary content. This type of attack has been used to trick users into entering data into what

looks like their bank’s Web site, but in fact is an XSS-hijacked form that submits their back account information to an

attacker.

The problem gets worse if you store this data in the database and later display it it on your site. For example, MySpace

was once found to be vulnerable to an XSS attack of this nature. A user inserted JavaScript into his proﬁle that

automatically added him as your friend when you visited his proﬁle page. Within a few days, he had millions of

friends.

Now, this may sound relatively benign, but keep in mind that this attacker managed to get his code – not MySpace’s

– running on your computer. This violates the assumed trust that all the code on MySpace is actually written by

MySpace.

22.3. Cross-Site Scripting (XSS) 291

The Django Book, Release 2.0

MySpace was extremely lucky that this malicious code didn’t automatically delete viewers’ accounts, change their

passwords, ﬂood the site with spam, or any of the other nightmare scenarios this vulnerability unleashes.

22.3.1 The Solution

The solution is simple: always escape any content that might have come from a user before inserting it into HTML.

To guard against this, Django’s template system automatically escapes all variable values. Let’s see what happens if

we rewrite our example using the template system:

# views.py

from django.shortcuts import render

def say_hello(request):

name = request.GET.get(’name’, ’world’)

return render(request, ’hello.html’, {’name’: name})

# hello.html

<h1>Hello, {{ name }}!</h1>

With this in place, a request to http://example.com/hello/name=<i>Jacob</i> will result in the fol-

lowing page:

<h1>Hello, <i>Jacob</i>!</h1>

We covered Django’s auto-escaping back in Chapter 4, along with ways to turn it off. But even if you’re using this

feature, you should still get in the habit of asking yourself, at all times, “Where does this data come from?” No

automatic solution will ever protect your site from XSS attacks 100% of the time.

22.4 Cross-Site Request Forgery

Cross-site request forgery (CSRF) happens when a malicious Web site tricks users into unknowingly loading a URL

from a site at which they’re already authenticated – hence taking advantage of their authenticated status.

Django has built-in tools to protect from this kind of attack. Both the attack itself and those tools are covered in great

detail in Chapter 16.

22.5 Session Forging/Hijacking

This isn’t a speciﬁc attack, but rather a general class of attacks on a user’s session data. It can take a number of

different forms:

• A man-in-the-middle attack, where an attacker snoops on session data as it travels over the wire (or wireless)

network.

•Session forging, where an attacker uses a session ID (perhaps obtained through a man-in-the-middle attack) to

pretend to be another user.

An example of these ﬁrst two would be an attacker in a coffee shop using the shop’s wireless network to capture

a session cookie. She could then use that cookie to impersonate the original user.

292 Chapter 22. Chapter 20: Security

The Django Book, Release 2.0

• A cookie-forging attack, where an attacker overrides the supposedly read-only data stored in a cookie. Chapter

14 explains in detail how cookies work, and one of the salient points is that it’s trivial for browsers and malicious

users to change cookies without your knowledge.

There’s a long history of Web sites that have stored a cookie like IsLoggedIn=1 or even

LoggedInAsUser=jacob. It’s dead simple to exploit these types of cookies.

On a more subtle level, though, it’s never a good idea to trust anything stored in cookies. You never know who’s

been poking at them.

•Session ﬁxation, where an attacker tricks a user into setting or reseting the user’s session ID.

For example, PHP allows session identiﬁers to be passed in the URL (e.g.,

http://example.com/?PHPSESSID=fa90197ca25f6ab40bb1374c510d7a32). An attacker

who tricks a user into clicking a link with a hard-coded session ID will cause the user to pick up that session.

Session ﬁxation has been used in phishing attacks to trick users into entering personal information into an

account the attacker owns. He can later log into that account and retrieve the data.

•Session poisoning, where an attacker injects potentially dangerous data into a user’s session – usually through a

Web form that the user submits to set session data.

A canonical example is a site that stores a simple user preference (like a page’s background color) in a cookie.

An attacker could trick a user into clicking a link to submit a “color” that actually contains an XSS attack. If

that color isn’t escaped, the user could again inject malicious code into the user’s environment.

22.5.1 The Solution

There are a number of general principles that can protect you from these attacks:

• Never allow session information to be contained in the URL.

Django’s session framework (see Chapter 14) simply doesn’t allow sessions to be contained in the URL.

• Don’t store data in cookies directly. Instead, store a session ID that maps to session data stored on the backend.

If you use Django’s built-in session framework (i.e., request.session), this is handled automatically for

you. The only cookie that the session framework uses is a single session ID; all the session data is stored in the

database.

• Remember to escape session data if you display it in the template. See the earlier XSS section, and remem-

ber that it applies to any user-created content as well as any data from the browser. You should treat session

information as being user created.

• Prevent attackers from spooﬁng session IDs whenever possible.

Although it’s nearly impossible to detect someone who’s hijacked a session ID, Django does have built-in

protection against a brute-force session attack. Session IDs are stored as hashes (instead of sequential numbers),

which prevents a brute-force attack, and a user will always get a new session ID if she tries a nonexistent one,

which prevents session ﬁxation.

Notice that none of those principles and tools prevents man-in-the-middle attacks. These types of attacks are nearly

impossible to detect. If your site allows logged-in users to see any sort of sensitive data, you should always serve that

site over HTTPS. Additionally, if you have an SSL-enabled site, you should set the SESSION_COOKIE_SECURE

setting to True; this will make Django only send session cookies over HTTPS.

22.5. Session Forging/Hijacking 293

The Django Book, Release 2.0

22.6 E-mail Header Injection

SQL injection’s less well-known sibling, e-mail header injection, hijacks Web forms that send e-mail. An attacker can

use this technique to send spam via your mail server. Any form that constructs e-mail headers from Web form data is

vulnerable to this kind of attack.

Let’s look at the canonical contact form found on many sites. Usually this sends a message to a hard-coded e-mail

address and, hence, doesn’t appear vulnerable to spam abuse at ﬁrst glance.

However, most of these forms also allow the user to type in his own subject for the e-mail (along with a “from”

address, body, and sometimes a few other ﬁelds). This subject ﬁeld is used to construct the “subject” header of the

e-mail message.

If that header is unescaped when building the e-mail message, an attacker could submit something like

"hello\ncc:spamvictim@example.com" (where "\n” is a newline character). That would make the con-

structed e-mail headers turn into:

To: hardcoded@example.com

Subject: hello

cc: spamvictim@example.com

Like SQL injection, if we trust the subject line given by the user, we’ll allow him to construct a malicious set of

headers, and he can use our contact form to send spam.

22.6.1 The Solution

We can prevent this attack in the same way we prevent SQL injection: always escape or validate user-submitted

content.

Django’s built-in mail functions (in django.core.mail) simply do not allow newlines in any ﬁelds used to con-

struct headers (the from and to addresses, plus the subject). If you try to use django.core.mail.send_mail

with a subject that contains newlines, Django will raise a BadHeaderError exception.

If you do not use Django’s built-in mail functions to send e-mail, you’ll need to make sure that newlines in headers

either cause an error or are stripped. You may want to examine the SafeMIMEText class in django.core.mail

to see how Django does this.

22.7 Directory Traversal

Directory traversal is another injection-style attack, wherein a malicious user tricks ﬁlesystem code into reading and/or

writing ﬁles that the Web server shouldn’t have access to.

An example might be a view that reads ﬁles from the disk without carefully sanitizing the ﬁle name:

def dump_file(request):

filename =request.GET["filename"]

filename =os.path.join(BASE_PATH, filename)

content =open(filename).read()

# ...

Though it looks like that view restricts ﬁle access to ﬁles beneath BASE_PATH (by using os.path.join), if the

attacker passes in a filename containing .. (that’s two periods, a shorthand for “the parent directory”), she can

access ﬁles “above” BASE_PATH. It’s only a matter of time before she can discover the correct number of dots to

successfully access, say, ../../../../../etc/passwd.

294 Chapter 22. Chapter 20: Security

The Django Book, Release 2.0

Anything that reads ﬁles without proper escaping is vulnerable to this problem. Views that write ﬁles are just as

vulnerable, but the consequences are doubly dire.

Another permutation of this problem lies in code that dynamically loads modules based on the URL or other request

information. A well-publicized example came from the world of Ruby on Rails. Prior to mid-2006, Rails used URLs

like http://example.com/person/poke/1 directly to load modules and call methods. The result was that a

carefully constructed URL could automatically load arbitrary code, including a database reset script!

22.7.1 The Solution

If your code ever needs to read or write ﬁles based on user input, you need to sanitize the requested path very carefully

to ensure that an attacker isn’t able to escape from the base directory you’re restricting access to.

Note: Needless to say, you should never write code that can read from any area of the disk!

A good example of how to do this escaping lies in Django’s built-in static content-serving view (in

django.views.static). Here’s the relevant code:

import os

import posixpath

# ...

path =posixpath.normpath(urllib.unquote(path))

newpath =’’

for part in path.split(’/’):

if not part:

# strip empty path components

continue

drive, part =os.path.splitdrive(part)

head, part =os.path.split(part)

if part in (os.curdir, os.pardir):

# strip ’.’ and ’..’ in path

continue

newpath =os.path.join(newpath, part).replace(’\\’,’/’)

Django doesn’t read ﬁles (unless you use the static.serve function, but that’s protected with the code just shown),

so this vulnerability doesn’t affect the core code much.

In addition, the use of the URLconf abstraction means that Django will never load code you’ve not explicitly told it to

load. There’s no way to create a URL that causes Django to load something not mentioned in a URLconf.

22.8 Exposed Error Messages

During development, being able to see tracebacks and errors live in your browser is extremely useful. Django has

“pretty” and informative debug messages speciﬁcally to make debugging easier.

However, if these errors get displayed once the site goes live, they can reveal aspects of your code or conﬁguration

that could aid an attacker.

Furthermore, errors and tracebacks aren’t at all useful to end users. Django’s philosophy is that site visitors should

never see application-related error messages. If your code raises an unhandled exception, a site visitor should not see

22.8. Exposed Error Messages 295

The Django Book, Release 2.0

the full traceback – or any hint of code snippets or Python (programmer-oriented) error messages. Instead, the visitor

should see a friendly “This page is unavailable” message.

Naturally, of course, developers need to see tracebacks to debug problems in their code. So the framework should hide

all error messages from the public, but it should display them to the trusted site developers.

22.8.1 The Solution

As we covered in Chapter 12, Django’s DEBUG setting controls the display of these error messages. Make sure to set

this to False when you’re ready to deploy.

Users deploying under Apache and mod_python (also see Chapter 12) should also make sure they have

PythonDebug Off in their Apache conf ﬁles; this will suppress any errors that occur before Django has had a

chance to load.

22.9 A Final Word on Security

We hope all this talk of security problems isn’t too intimidating. It’s true that the Web can be a wild world, but with a

little bit of foresight, you can have a secure Web site.

Keep in mind that Web security is a constantly changing ﬁeld; if you’re reading the dead-tree version of this book,

be sure to check more up to date security resources for any new vulnerabilities that have been discovered. In fact,

it’s always a good idea to spend some time each week or month researching and keeping current on the state of Web

application security. It’s a small investment to make, but the protection you’ll get for your site and your users is

priceless.

22.10 What’s Next?

You’ve reached the end of our regularly scheduled program. The following appendixes all contain reference material

that you might need as you work on your Django projects.

We wish you the best of luck in running your Django site, whether it’s a little toy for you and a few friends, or the next

Google.

296 Chapter 22. Chapter 20: Security

CHAPTER 23

Appendix A: Model Deﬁnition Reference

Chapter 5 explains the basics of deﬁning models, and we use them throughout the rest of the book. There is, however,

ahuge range of model options available not covered elsewhere. This appendix explains each possible model deﬁnition

option.

Note that although these APIs are considered stable, the Django developers consistently add new shortcuts and

conveniences to the model deﬁnition. It’s a good idea to always check the latest documentation online at

http://docs.djangoproject.com/.

23.1 Fields

The most important part of a model – and the only required part of a model – is the list of database ﬁelds it deﬁnes.

Field Name Restrictions

Django places only two restrictions on model ﬁeld names:

1. A ﬁeld name cannot be a Python reserved word, because that would result in a Python syntax error. For example:

class Example(models.Model):

pass = models.IntegerField() # ’pass’ is a reserved word!

2. A ﬁeld name cannot contain more than one underscore in a row, due to the way Django’s query lookup syntax

works. For example:

class Example(models.Model):

foo__bar =models.IntegerField() # ’foo__bar’ has two underscores!

These limitations can be worked around, though, because your ﬁeld name doesn’t necessarily have to match your

database column name. See “db_column”, below.

SQL reserved words, such as join,where, or select,are allowed as model ﬁeld names, because Django escapes

all database table names and column names in every underlying SQL query. It uses the quoting syntax of your

particular database engine.

Each ﬁeld in your model should be an instance of the appropriate Field class. Django uses the ﬁeld class types to

determine a few things:

• The database column type (e.g., INTEGER,VARCHAR).

• The widget to use in Django’s forms and admin site, if you care to use it (e.g., <input type="text">,

<select>).

297

The Django Book, Release 2.0

• The minimal validation requirements, which are used in Django’s admin interface and by forms.

A complete list of ﬁeld classes follows, sorted alphabetically. Note that relationship ﬁelds (ForeignKey, etc.) are

handled in the next section.

23.1.1 AutoField

An IntegerField that automatically increments according to available IDs. You usually won’t need to use this

directly; a primary key ﬁeld will automatically be added to your model if you don’t specify otherwise.

23.1.2 BooleanField

A true/false ﬁeld.

MySQL users...

A boolean ﬁeld in MySQL is stored as a TINYINT column with a value of either 0 or 1 (most databases have a

proper BOOLEAN type instead). So, for MySQL, only, when a BooleanField is retrieved from the database and

stored on a model attribute, it will have the values 1 or 0, rather than True or False. Normally, this shouldn’t

be a problem, since Python guarantees that 1 == True and 0 == False are both true. Just be careful if you’re

writing something like obj is True when obj is a value from a boolean attribute on a model. If that model was

constructed using the mysql backend, the “is” test will fail. Prefer an equality test (using “==”) in cases like this.

23.1.3 CharField

A string ﬁeld, for small- to large-sized strings.

For very large amounts of text, use TextField.

CharField has one extra required argument: max_length. This is the maximum length (in characters) of the

ﬁeld. The max_length is enforced at the database level and in Django’s validation.

23.1.4 CommaSeparatedIntegerField

A ﬁeld of integers separated by commas. As in CharField, the max_length argument is required.

23.1.5 DateField

A date, represented in Python by a datetime.date instance.

23.1.6 DateTimeField

A date and time, represented in Python by a datetime.datetime instance.

23.1.7 DecimalField

A ﬁxed-precision decimal number, represented in Python by a decimal.Decimal instance. Has two required

arguments:

max_digits The maximum number of digits allowed in the number

298 Chapter 23. Appendix A: Model Deﬁnition Reference

The Django Book, Release 2.0

decimal_places The number of decimal places to store with the number

For example, to store numbers up to 999 with a resolution of 2 decimal places, you’d use:

models.DecimalField(..., max_digits=5, decimal_places=2)

And to store numbers up to approximately one billion with a resolution of 10 decimal places:

models.DecimalField(..., max_digits=19, decimal_places=10)

When assigning to a DecimalField, use either a decimal.Decimal object or a string – not a Python ﬂoat.

23.1.8 EmailField

ACharField that checks that the value is a valid e-mail address.

23.1.9 FileField

A ﬁle-upload ﬁeld.

Note: The primary_key and unique arguments are not supported, and will raise a TypeError if used.

Has one required argument:

upload_to A local ﬁlesystem path that will be appended to your MEDIA_ROOT setting to determine the value of

the django.core.files.File.url attribute.

This path may contain “strftime formatting” (see the Python docs for the time standard library module), which

will be replaced using the date/time of the ﬁle upload (so that uploaded ﬁles don’t ﬁll up the given directory).

This may also be a callable, such as a function, which will be called to obtain the upload path, including the

ﬁlename. This callable must be able to accept two arguments, and return a Unix-style path (with forward slashes)

to be passed along to the storage system. The two arguments that will be passed are:

Argu-

ment

Description

instanceAn instance of the model where the FileField is deﬁned. More speciﬁcally, this is the

particular instance where the current ﬁle is being attached.

In most cases, this object will not have been saved to the database yet, so if it uses the default

AutoField,it might not yet have a value for its primary key ﬁeld.

filenameThe ﬁlename that was originally given to the ﬁle. This may or may not be taken into account

when determining the ﬁnal destination path.

Also has one optional argument:

storage Optional. A storage object, which handles the storage and retrieval of your ﬁles.

Using a FileField or an ImageField (see below) in a model takes a few steps:

1. In your settings ﬁle, you’ll need to deﬁne MEDIA_ROOT as the full path to a directory where you’d like Django

to store uploaded ﬁles. (For performance, these ﬁles are not stored in the database.) Deﬁne MEDIA_URL as the

base public URL of that directory. Make sure that this directory is writable by the Web server’s user account.

2. Add the FileField or ImageField to your model, making sure to deﬁne the upload_to option to tell

Django to which subdirectory of MEDIA_ROOT it should upload ﬁles.

3. All that will be stored in your database is a path to the ﬁle (relative to MEDIA_ROOT). You’ll most likely

want to use the convenience url function provided by Django. For example, if your ImageField is called

23.1. Fields 299

The Django Book, Release 2.0

mug_shot, you can get the absolute URL to your image in a template with {{ object.mug_shot.url

}}.

For example, say your MEDIA_ROOT is set to ’/home/media’, and upload_to is set to

’photos/%Y/%m/%d’. The ’%Y/%m/%d’ part of upload_to is strftime formatting; ’%Y’ is the four-

digit year, ’%m’ is the two-digit month and ’%d’ is the two-digit day. If you upload a ﬁle on Jan. 15, 2007, it will be

saved in the directory /home/media/photos/2007/01/15.

If you want to retrieve the upload ﬁle’s on-disk ﬁlename, or a URL that refers to that ﬁle, or the ﬁle’s size, you can use

the name,url and size attributes.

Note that whenever you deal with uploaded ﬁles, you should pay close attention to where you’re uploading them and

what type of ﬁles they are, to avoid security holes. Validate all uploaded ﬁles so that you’re sure the ﬁles are what you

think they are. For example, if you blindly let somebody upload ﬁles, without validation, to a directory that’s within

your Web server’s document root, then somebody could upload a CGI or PHP script and execute that script by visiting

its URL on your site. Don’t allow that.

By default, FileField instances are created as varchar(100) columns in your database. As with other ﬁelds,

you can change the maximum length using the max_length argument.

23.1.10 FilePathField

ACharField whose choices are limited to the ﬁlenames in a certain directory on the ﬁlesystem. Has three special

arguments, of which the ﬁrst is required:

path Required. The absolute ﬁlesystem path to a directory from which this FilePathField should get its

choices. Example: "/home/images".

match Optional. A regular expression, as a string, that FilePathField will use to ﬁlter ﬁlenames. Note that the

regex will be applied to the base ﬁlename, not the full path. Example: "foo.*\.txt$", which will match a

ﬁle called foo23.txt but not bar.txt or foo23.gif.

recursive Optional. Either True or False. Default is False. Speciﬁes whether all subdirectories of path

should be included.

Of course, these arguments can be used together.

The one potential gotcha is that match applies to the base ﬁlename, not the full path. So, this example:

FilePathField(path="/home/images", match="foo.*", recursive=True)

...will match /home/images/bar/foo.gif but not /home/images/foo/bar.gif because the match ap-

plies to the base ﬁlename (foo.gif and bar.gif).

By default, FilePathField instances are created as varchar(100) columns in your database. As with other

ﬁelds, you can change the maximum length using the max_length argument.

23.1.11 FloatField

A ﬂoating-point number represented in Python by a float instance.

23.1.12 ImageField

Like FileField, but validates that the uploaded object is a valid image. Has two extra optional arguments:

height_field Name of a model ﬁeld which will be auto-populated with the height of the image each time the

model instance is saved.

300 Chapter 23. Appendix A: Model Deﬁnition Reference

The Django Book, Release 2.0

width_field Name of a model ﬁeld which will be auto-populated with the width of the image each time the model

instance is saved.

In addition to the special attributes that are available for FileField‘‘, an ImageField also has height and width

attributes, both of which correspond to the image’s height and width in pixels.

Requires the Python Imaging Library, available at http://www.pythonware.com/products/pil/.

By default, ImageField instances are created as varchar(100) columns in your database. As with other ﬁelds,

you can change the maximum length using the max_length argument.

23.1.13 IntegerField

An integer.

23.1.14 IPAddressField

An IP address, in string format (e.g. ’192.0.2.30’).

23.1.15 NullBooleanField

Like a BooleanField, but allows NULL as one of the options. Use this instead of a BooleanField with

null=True.

23.1.16 PositiveIntegerField

Like an IntegerField, but must be positive.

23.1.17 PositiveSmallIntegerField

Like a PositiveIntegerField, but only allows values under a certain (database-dependent) point.

23.1.18 SlugField

“Slug” is a newspaper term. A slug is a short label for something, containing only letters, numbers, underscores or

hyphens. They’re generally used in URLs.

Like a CharField, you can specify max_length. If max_length is not speciﬁed, Django will use a default

length of 50.

Implies setting db_index to True.

23.1.19 SmallIntegerField

Like an IntegerField, but only allows values under a certain (database-dependent) point.

23.1. Fields 301

The Django Book, Release 2.0

23.1.20 TextField

A large text ﬁeld.

Also see CharField for storing smaller bits of text.

23.1.21 TimeField

A time, represented in Python by a datetime.time instance. Accepts the same auto-population options as

DateField.

23.1.22 URLField

ACharField for a URL. Has one extra optional argument:

verify_exists If True (the default), the URL given will be checked for existence (i.e., the URL actually loads

and doesn’t give a 404 response). It should be noted that when using the single-threaded development server,

validating a url being served by the same server will hang. This should not be a problem for multithreaded

servers.

Like all CharField subclasses, URLField takes the optional max_length argument. If you don’t specify

max_length, a default of 200 is used.

23.1.23 XMLField

ATextField that checks that the value is valid XML that matches a given schema. Takes one required argument:

schema_path The ﬁlesystem path to a RelaxNG schema against which to validate the ﬁeld. For more on RelaxNG,

see http://www.relaxng.org/.

23.2 Universal Field Options

The following arguments are available to all ﬁeld types. All are optional.

23.2.1 null

If True, Django will store empty values as NULL in the database. If False, saving empty values will likely result in

a database error. Default is False.

Note that empty string values will always get stored as empty strings, not as NULL. Only use null=True for non-

string ﬁelds such as integers, booleans and dates. For both types of ﬁelds, you will also need to set blank=True if

you wish to permit empty values in forms, as the null parameter only affects database storage (see blank).

Avoid using null on string-based ﬁelds such as CharField and TextField unless you have an excellent reason.

If a string-based ﬁeld has null=True, that means it has two possible values for “no data”: NULL, and the empty

string. In most cases, it’s redundant to have two possible values for “no data;” Django’s convention is to use the empty

string, not NULL.

Note: When using the Oracle database backend, the null=True option will be coerced for string-based ﬁelds that

have the empty string as a possible value, and the value NULL will be stored to denote the empty string.

For more on this, see the section “Making Date and Numeric Fields Optional” in Chapter 6.

302 Chapter 23. Appendix A: Model Deﬁnition Reference

The Django Book, Release 2.0

23.2.2 blank

If True, the ﬁeld is allowed to be blank. Default is False.

Note that this is different than null.null is purely database-related, whereas blank is validation-related. If

a ﬁeld has blank=True, validation on Django’s admin site will allow entry of an empty value. If a ﬁeld has

blank=False, the ﬁeld will be required.

23.2.3 choices

An iterable (e.g., a list or tuple) of 2-tuples to use as choices for this ﬁeld.

A choices list looks like this:

YEAR_IN_SCHOOL_CHOICES =(

(’FR’,’Freshman’),

(’SO’,’Sophomore’),

(’JR’,’Junior’),

(’SR’,’Senior’),

(’GR’,’Graduate’),

)

The ﬁrst element in each tuple is the actual value to be stored. The second element is the human-readable name for

the option.

The choices list can be deﬁned either as part of your model class:

class Foo(models.Model):

GENDER_CHOICES =(

(’M’,’Male’),

(’F’,’Female’),

)

gender =models.CharField(max_length=1, choices=GENDER_CHOICES)

or outside your model class altogether:

GENDER_CHOICES =(

(’M’,’Male’),

(’F’,’Female’),

)

class Foo(models.Model):

gender =models.CharField(max_length=1, choices=GENDER_CHOICES)

You can also collect your available choices into named groups that can be used for organizational purposes in a form:

MEDIA_CHOICES =(

(’Audio’, (

(’vinyl’,’Vinyl’),

(’cd’,’CD’),

)

(’Video’, (

(’vhs’,’VHS Tape’),

(’dvd’,’DVD’),

)

(’unknown’,’Unknown’),

)

23.2. Universal Field Options 303

The Django Book, Release 2.0

The ﬁrst element in each tuple is the name to apply to the group. The second element is an iterable of 2-tuples, with

each 2-tuple containing a value and a human-readable name for an option. Grouped options may be combined with

ungrouped options within a single list (such as the unknown option in this example).

Finally, note that choices can be any iterable object – not necessarily a list or tuple. This lets you construct choices

dynamically. But if you ﬁnd yourself hacking choices to be dynamic, you’re probably better off using a proper

database table with a ForeignKey‘.choices is meant for static data that doesn’t change much, if ever.

23.2.4 db_column

The name of the database column to use for this ﬁeld. If this isn’t given, Django will use the ﬁeld’s name.

If your database column name is an SQL reserved word, or contains characters that aren’t allowed in Python variable

names – notably, the hyphen – that’s OK. Django quotes column and table names behind the scenes.

23.2.5 db_index

If True,django-admin.py sqlindexes will output a CREATE INDEX statement for this ﬁeld.

23.2.6 db_tablespace

The name of the database tablespace to use for this ﬁeld’s index, if this ﬁeld is indexed. The default is the project’s

DEFAULT_INDEX_TABLESPACE setting, if set, or the db_tablespace of the model, if any. If the backend

doesn’t support tablespaces, this option is ignored.

23.2.7 default

The default value for the ﬁeld. This can be a value or a callable object. If callable it will be called every time a new

object is created.

23.2.8 editable

If False, the ﬁeld will not be editable in the admin or via forms automatically generated from the model class.

Default is True.

23.2.9 help_text

Extra “help” text to be displayed under the ﬁeld on the object’s admin form. It’s useful for documentation even if your

object doesn’t have an admin form.

Note that this value is not HTML-escaped when it’s displayed in the admin interface. This lets you include HTML in

help_text if you so desire. For example:

help_text="Please use the following format: <em>YYYY-MM-DD</em>."

Alternatively you can use plain text and django.utils.html.escape() to escape any HTML special charac-

ters.

304 Chapter 23. Appendix A: Model Deﬁnition Reference

The Django Book, Release 2.0

23.2.10 primary_key

If True, this ﬁeld is the primary key for the model.

If you don’t specify primary_key=True for any ﬁelds in your model, Django will automatically add an

AutoField to hold the primary key, so you don’t need to set primary_key=True on any of your ﬁelds unless

you want to override the default primary-key behavior.

primary_key=True implies null=False and unique=True. Only one primary key is allowed on an object.

23.2.11 unique

If True, this ﬁeld must be unique throughout the table.

This is enforced at the database level and at the level of forms created with ModelForm (including forms in the

Django admin site). If you try to save a model with a duplicate value in a unique ﬁeld, an IntegrityError will

be raised by the model’s save method.

This option is valid on all ﬁeld types except ManyToManyField,FileField and ImageField.

23.2.12 unique_for_date

Set this to the name of a DateField or DateTimeField to require that this ﬁeld be unique for the value of the

date ﬁeld.

For example, if you have a ﬁeld title that has unique_for_date="pub_date", then Django wouldn’t allow

the entry of two records with the same title and pub_date.

This is enforced at the level of forms created with ModelForm (including forms in the Django admin site) but not at

the database level.

23.2.13 unique_for_month

Like unique_for_date, but requires the ﬁeld to be unique with respect to the month.

23.2.14 unique_for_year

Like unique_for_date and unique_for_month.

23.2.15 verbose_name

A human-readable name for the ﬁeld. If the verbose name isn’t given, Django will automatically create it using the

ﬁeld’s attribute name, converting underscores to spaces.

23.3 Relationships

Clearly, the power of relational databases lies in relating tables to each other. Django offers ways to deﬁne the three

most common types of database relationships: many-to-one, many-to-many, and one-to-one.

23.3. Relationships 305

The Django Book, Release 2.0

23.3.1 ForeignKey

A many-to-one relationship. Requires a positional argument: the class to which the model is related.

To create a recursive relationship – an object that has a many-to-one relationship with itself – use

models.ForeignKey(’self’).

If you need to create a relationship on a model that has not yet been deﬁned, you can use the name of the model, rather

than the model object itself:

class Car(models.Model):

manufacturer = models.ForeignKey(’Manufacturer’)

# ...

class Manufacturer(models.Model):

# ...

Note, however, that this only refers to models in the same models.py ﬁle.

To refer to models deﬁned in another application, you must instead explicitly specify the application label. For ex-

ample, if the Manufacturer model above is deﬁned in another application called production, you’d need to

use:

class Car(models.Model):

manufacturer =models.ForeignKey(’production.Manufacturer’)

Behind the scenes, Django appends "_id" to the ﬁeld name to create its database column name. In the above example,

the database table for the Car model will have a manufacturer_id column. (You can change this explicitly by

specifying db_column) However, your code should never have to deal with the database column name, unless you

write custom SQL. You’ll always deal with the ﬁeld names of your model object.

ForeignKey accepts an extra set of arguments – all optional – that deﬁne the details of how the relation works.

limit_choices_to A dictionary of lookup arguments and values that limit the available admin choices for this

object. Use this with functions from the Python datetime module to limit choices of objects by date. For

example:

limit_choices_to ={’pub_date__lte’: datetime.now}

only allows the choice of related objects with a pub_date before the current date/time to be chosen.

limit_choices_to has no effect on the inline FormSets that are created to display related objects in the

admin.

related_name The name to use for the relation from the related object back to this one.

to_field The ﬁeld on the related object that the relation is to. By default, Django uses the primary key of the

related object.

23.3.2 ManyToManyField

A many-to-many relationship. Requires a positional argument: the class to which the model is related. This works

exactly the same as it does for ForeignKey, including all the options regarding recursive relationships and lazy

relationships.

Behind the scenes, Django creates an intermediary join table to represent the many-to-many relationship. By default,

this table name is generated using the names of the two tables being joined. Since some databases don’t support table

names above a certain length, these table names will be automatically truncated to 64 characters and a uniqueness hash

will be used. This means you might see table names like author_books_9cdf4; this is perfectly normal. You can

manually provide the name of the join table using the db_table option.

306 Chapter 23. Appendix A: Model Deﬁnition Reference

The Django Book, Release 2.0

ManyToManyField accepts an extra set of arguments – all optional – that control how the relationship functions.

related_name Same as related_name in ForeignKey.

limit_choices_to Same as limit_choices_to in ForeignKey.

limit_choices_to has no effect when used on a ManyToManyField with a custom intermediate table

speciﬁed using the through paramter.

symmetrical Only used in the deﬁnition of ManyToManyFields on self. Consider the following model:

class Person(models.Model):

friends =models.ManyToManyField("self")

When Django processes this model, it identiﬁes that it has a ManyToManyField on itself, and as a result, it

doesn’t add a person_set attribute to the Person class. Instead, the ManyToManyField is assumed to

be symmetrical – that is, if I am your friend, then you are my friend.

If you do not want symmetry in many-to-many relationships with self, set symmetrical to False. This

will force Django to add the descriptor for the reverse relationship, allowing ManyToManyField relationships

to be non-symmetrical.

through Django will automatically generate a table to manage many-to-many relationships. However, if you want

to manually specify the intermediary table, you can use the through option to specify the Django model that

represents the intermediate table that you want to use.

The most common use for this option is when you want to associate extra data with a many-to-many relationship.

db_table The name of the table to create for storing the many-to-many data. If this is not provided, Django will

assume a default name based upon the names of the two tables being joined.

23.3.3 OneToOneField

A one-to-one relationship. Conceptually, this is similar to a ForeignKey with unique=True, but the “reverse”

side of the relation will directly return a single object.

This is most useful as the primary key of a model which “extends” another model in some way; multi-table-inheritance

is implemented by adding an implicit one-to-one relation from the child model to the parent model, for example.

One positional argument is required: the class to which the model will be related. This works exactly the same as it

does for ForeignKey, including all the options regarding recursive relationships and lazy relationships.

Additionally, OneToOneField accepts all of the extra arguments accepted by ForeignKey, plus one extra argu-

ment:

parent_link When True and used in a model which inherits from another (concrete) model, indicates that this

ﬁeld should be used as the link back to the parent class, rather than the extra OneToOneField which would

normally be implicitly created by subclassing.

23.4 Model Metadata Options

Model-speciﬁc metadata lives in a class Meta deﬁned in the body of your model class:

class Book(models.Model):

title =models.CharField(maxlength=100)

class Meta:

# model metadata options go here

...

23.4. Model Metadata Options 307

The Django Book, Release 2.0

Model metadata is “anything that’s not a ﬁeld,” such as ordering options and so forth.

The sections that follow present a list of all possible Meta options. No options are required. Adding class Meta

to a model is completely optional.

23.4.1 abstract

If True, this model will be an abstract base class. See the Django documentation for more on abstract base classes.

23.4.2 db_table

The name of the database table to use for the model:

db_table =’music_album’

Table names

To save you time, Django automatically derives the name of the database table from the name of your model class and

the app that contains it. A model’s database table name is constructed by joining the model’s “app label” – the name

you used in manage.py startapp – to the model’s class name, with an underscore between them.

For example, if you have an app bookstore (as created by manage.py startapp bookstore), a model

deﬁned as class Book will have a database table named bookstore_book.

To override the database table name, use the db_table parameter in class Meta.

If your database table name is an SQL reserved word, or contains characters that aren’t allowed in Python variable

names – notably, the hyphen – that’s OK. Django quotes column and table names behind the scenes.

23.4.3 db_tablespace

The name of the database tablespace to use for the model. If the backend doesn’t support tablespaces, this option is

ignored.

23.4.4 get_latest_by

The name of a DateField or DateTimeField in the model. This speciﬁes the default ﬁeld to use in your model

Manager‘s latest method.

Example:

get_latest_by ="order_date"

23.4.5 managed

Defaults to True, meaning Django will create the appropriate database tables in django-admin.py syncdb and

remove them as part of a reset management command. That is, Django manages the database tables’ lifecycles.

If False, no database table creation or deletion operations will be performed for this model. This is useful if the

model represents an existing table or a database view that has been created by some other means. This is the only

difference when managed is False. All other aspects of model handling are exactly the same as normal. This

includes

308 Chapter 23. Appendix A: Model Deﬁnition Reference

The Django Book, Release 2.0

1. Adding an automatic primary key ﬁeld to the model if you don’t declare it. To avoid confusion for later code

readers, it’s recommended to specify all the columns from the database table you are modeling when using

unmanaged models.

2. If a model with managed=False contains a ManyToManyField that points to another unmanaged model,

then the intermediary table for the many-to-many join will also not be created. However, the intermediary table

between one managed and one unmanaged model will be created.

If you need to change this default behavior, create the intermediary table as an explicit model (with managed

set as needed) and use the through attribute to make the relation use your custom model.

For tests involving models with managed=False, it’s up to you to ensure the correct tables are created as part of the

test setup.

If you’re interested in changing the Python-level behavior of a model class, you could use managed=False and

create a copy of an existing model. However, there’s a better approach for that situation: proxy-models.

23.4.6 ordering

The default ordering for the object, for use when obtaining lists of objects:

ordering =[’-order_date’]

This is a tuple or list of strings. Each string is a ﬁeld name with an optional “-” preﬁx, which indicates descending

order. Fields without a leading “-” will be ordered ascending. Use the string ”?” to order randomly.

Note: Regardless of how many ﬁelds are in ordering, the admin site uses only the ﬁrst ﬁeld.

For example, to order by a pub_date ﬁeld ascending, use this:

ordering =[’pub_date’]

To order by pub_date descending, use this:

ordering =[’-pub_date’]

To order by pub_date descending, then by author ascending, use this:

ordering =[’-pub_date’,’author’]

23.4.7 proxy

If set to True, a model which subclasses another model will be treated as a proxy model. For more on proxy models,

see the Django documentation.

23.4.8 unique_together

Sets of ﬁeld names that, taken together, must be unique:

unique_together =(("driver","restaurant"),)

This is a list of lists of ﬁelds that must be unique when considered together. It’s used by ModelForm forms (including

forms in the Django admin site) and is enforced at the database level (i.e., the appropriate UNIQUE statements are

included in the CREATE TABLE statement).

For convenience, unique_together can be a single sequence when dealing with a single set of ﬁelds:

23.4. Model Metadata Options 309

The Django Book, Release 2.0

unique_together =("driver","restaurant")

23.4.9 verbose_name

A human-readable name for the object, singular:

verbose_name ="pizza"

If this isn’t given, Django will use a munged version of the class name: CamelCase becomes camel case.

23.4.10 verbose_name_plural

The plural name for the object:

verbose_name_plural ="stories"

If this isn’t given, Django will use verbose_name +"s".

310 Chapter 23. Appendix A: Model Deﬁnition Reference

CHAPTER 24

Appendix B: Database API Reference

Django’s database API is the other half of the model API discussed in Appendix A. Once you’ve deﬁned a model,

you’ll use this API any time you need to access the database. You’ve seen examples of this API in use throughout the

book; this appendix explains all the various options in detail.

Like the model APIs discussed in Appendix A, though these APIs are considered very stable, the Django developers

consistently add new shortcuts and conveniences. It’s a good idea to always check the latest documentation online,

available at http://docs.djangoproject.com/.

Throughout this reference, we’ll refer to the following models, which might form a simple blog application:

from django.db import models

class Blog(models.Model):

name =models.CharField(max_length=100)

tagline =models.TextField()

def __unicode__(self):

return self.name

class Author(models.Model):

name =models.CharField(max_length=50)

email =models.EmailField()

def __unicode__(self):

return self.name

class Entry(models.Model):

blog =models.ForeignKey(Blog)

headline =models.CharField(max_length=255)

body_text =models.TextField()

pub_date =models.DateTimeField()

authors =models.ManyToManyField(Author)

def __unicode__(self):

return self.headline

24.1 Creating Objects

To create an object, instantiate it using keyword arguments to the model class, and then call save() to save it to the

database:

311

The Django Book, Release 2.0

>>> from mysite.blog.models import Blog

>>> b=Blog(name=’Beatles Blog’, tagline=’All the latest Beatles news.’)

>>> b.save()

This performs an INSERT SQL statement behind the scenes. Django doesn’t hit the database until you explicitly call

save().

The save() method has no return value.

To create an object and save it all in one step, see the create manager method.

24.1.1 What Happens When You Save?

When you save an object, Django performs the following steps:

1. Emit a pre_save signal. This provides a notiﬁcation that an object is about to be saved. You can register

a listener that will be invoked whenever this signal is emitted. Check the online documentation for more on

signals.

2. Preprocess the data. Each ﬁeld on the object is asked to perform any automated data modiﬁcation that the ﬁeld

may need to perform.

Most ﬁelds do no preprocessing – the ﬁeld data is kept as is. Preprocessing is only used on ﬁelds that have

special behavior, like ﬁle ﬁelds.

3. Prepare the data for the database. Each ﬁeld is asked to provide its current value in a data type that can be

written to the database.

Most ﬁelds require no data preparation. Simple data types, such as integers and strings, are “ready to write”

as a Python object. However, more complex data types often require some modiﬁcation. For example,

DateFields use a Python datetime object to store data. Databases don’t store datetime objects, so

the ﬁeld value must be converted into an ISO-compliant date string for insertion into the database.

4. Insert the data into the database. The preprocessed, prepared data is then composed into an SQL statement

for insertion into the database.

5. Emit a post_save signal. As with the pre_save signal, this is used to provide notiﬁcation that an object has

been successfully saved.

24.1.2 Autoincrementing Primary Keys

For convenience, each model is given an autoincrementing primary key ﬁeld named id unless you explicitly specify

primary_key=True on a ﬁeld (see the section titled “AutoField” in Appendix A).

If your model has an AutoField, that autoincremented value will be calculated and saved as an attribute on your

object the ﬁrst time you call save():

>>> b2 =Blog(name=’Cheddar Talk’, tagline=’Thoughts on cheese.’)

>>> b2.id # Returns None, because b doesn’t have an ID yet.

None

>>> b2.save()

>>> b2.id # Returns the ID of your new object.

There’s no way to tell what the value of an ID will be before you call save(), because that value is calculated by

your database, not by Django.

312 Chapter 24. Appendix B: Database API Reference

The Django Book, Release 2.0

If a model has an AutoField but you want to deﬁne a new object’s ID explicitly when saving, just deﬁne it explicitly

before saving, rather than relying on the autoassignment of the ID:

>>> b3 =Blog(id=3, name=’Cheddar Talk’, tagline=’Thoughts on cheese.’)

>>> b3.id

>>> b3.save()

>>> b3.id

If you assign auto-primary-key values manually, make sure not to use an already existing primary key value! If you

create a new object with an explicit primary key value that already exists in the database, Django will assume you’re

changing the existing record rather than creating a new one.

Given the preceding ’Cheddar Talk’ blog example, this example would override the previous record in the

database:

>>> b4 =Blog(id=3, name=’Not Cheddar’, tagline=’Anything but cheese.’)

>>> b4.save() # Overrides the previous blog with ID=3!

Explicitly specifying auto-primary-key values is mostly useful for bulk-saving objects, when you’re conﬁdent you

won’t have primary key collision.

24.2 Saving Changes to Objects

To save changes to an object that’s already in the database, use save().

Given a Blog instance b5 that has already been saved to the database, this example changes its name and updates its

record in the database:

>>> b5.name =’New name’

>>> b5.save()

This performs an UPDATE SQL statement behind the scenes. Again, Django doesn’t hit the database until you explic-

itly call save().

How Django Knows When to UPDATE and When to INSERT

You may have noticed that Django database objects use the same save() method for creating and changing objects.

Django abstracts the need to use INSERT or UPDATE SQL statements. Speciﬁcally, when you call save(), Django

follows this algorithm:

• If the object’s primary key attribute is set to a value that evaluates to True (i.e., a value other than None or

the empty string), Django executes a SELECT query to determine whether a record with the given primary key

already exists.

• If the record with the given primary key does already exist, Django executes an UPDATE query.

• If the object’s primary key attribute is not set, or if it’s set but a record doesn’t exist, Django executes an

INSERT.

Because of this, you should be careful not to specify a primary key value explicitly when saving new objects if you

cannot guarantee the primary key value is unused.

Updating ForeignKey ﬁelds works exactly the same way; simply assign an object of the right type to the ﬁeld in

question:

24.2. Saving Changes to Objects 313

The Django Book, Release 2.0

>>> joe =Author.objects.create(name="Joe")

>>> entry.author =joe

>>> entry.save()

Django will complain if you try to assign an object of the wrong type.

24.3 Retrieving Objects

Throughout the book you’ve seen objects retrieved using code like the following:

>>> blogs =Blog.objects.filter(author__name__contains="Joe")

There are quite a few “moving parts” behind the scenes here: when you retrieve objects from the database, you’re

actually constructing a QuerySet using the model’s Manager. This QuerySet knows how to execute SQL and

return the requested objects.

Appendix A looked at both of these objects from a model-deﬁnition point of view; now we’ll look at how they operate.

AQuerySet represents a collection of objects from your database. It can have zero, one, or many ﬁlters – criteria that

narrow down the collection based on given parameters. In SQL terms, a QuerySet equates to a SELECT statement,

and a ﬁlter is a WHERE.

You get a QuerySet by using your model’s Manager. Each model has at least one Manager, and it’s called

objects by default. Access it directly via the model class, like so:

>>> Blog.objects

<django.db.models.manager.Manager object at 0x137d00d>

Managers are accessible only via model classes, rather than from model instances, to enforce a separation between

“table-level” operations and “record-level” operations:

>>> b=Blog(name=’Foo’, tagline=’Bar’)

>>> b.objects

Traceback (most recent call last):

File "<stdin>", line 1, in <module>

AttributeError: Manager isn’t accessible via Blog instances.

The Manager is the main source of QuerySets for a model. It acts as a “root” QuerySet that describes all objects

in the model’s database table. For example, Blog.objects is the initial QuerySet that contains all Blog objects

in the database.

24.4 Caching and QuerySets

Each QuerySet contains a cache, to minimize database access. It’s important to understand how it works, in order

to write the most efﬁcient code.

In a newly created QuerySet, the cache is empty. The ﬁrst time a QuerySet is evaluated – and, hence, a database

query happens – Django saves the query results in the QuerySet‘s cache and returns the results that have been

explicitly requested (e.g., the next element, if the QuerySet is being iterated over). Subsequent evaluations of the

QuerySet reuse the cached results.

Keep this caching behavior in mind, because it may bite you if you don’t use your QuerySets correctly. For example,

the following will create two QuerySets, evaluate them, and throw them away:

314 Chapter 24. Appendix B: Database API Reference

The Django Book, Release 2.0

print [e.headline for ein Entry.objects.all()]

print [e.pub_date for ein Entry.objects.all()]

That means the same database query will be executed twice, effectively doubling your database load. Also, there’s

a possibility the two lists may not include the same database records, because an Entry may have been added or

deleted in the split second between the two requests.

To avoid this problem, simply save the QuerySet and reuse it:

queryset =Poll.objects.all()

print [p.headline for pin queryset] # Evaluate the query set.

print [p.pub_date for pin queryset] # Reuse the cache from the evaluation.

24.5 Filtering Objects

The simplest way to retrieve objects from a table is to get all of them. To do this, use the all() method on a

Manager:

>>> Entry.objects.all()

The all() method returns a QuerySet of all the objects in the database.

Usually, though, you’ll need to select only a subset of the complete set of objects. To create such a subset, you reﬁne

the initial QuerySet, adding ﬁlter conditions. You’ll usually do this using the filter() and/or exclude()

methods:

>>> y2006 =Entry.objects.filter(pub_date__year=2006)

>>> not2006 =Entry.objects.exclude(pub_date__year=2006)

filter() and exclude() both take ﬁeld lookup arguments, which are discussed in detail shortly.

24.5.1 Chaining Filters

The result of reﬁning a QuerySet is itself a QuerySet, so it’s possible to chain reﬁnements together, for example:

>>> qs =Entry.objects.filter(headline__startswith=’What’)

>>> qs =qs.exclude(pub_date__gte=datetime.datetime.now())

>>> qs =qs.filter(pub_date__gte=datetime.datetime(2005,1,1))

This takes the initial QuerySet of all entries in the database, adds a ﬁlter, then an exclusion, and then another ﬁlter.

The ﬁnal result is a QuerySet containing all entries with a headline that starts with “What” that were published

between January 1, 2005, and the current day.

It’s important to point out here that QuerySets are lazy – the act of creating a QuerySet doesn’t involve any

database activity. In fact, the three preceding lines don’t make any database calls; you can chain ﬁlters together all day

long and Django won’t actually run the query until the QuerySet is evaluated.

You can evaluate a QuerySet in any following ways:

•Iterating: A QuerySet is iterable, and it executes its database query the ﬁrst time you iterate over it. For

example, the following QuerySet isn’t evaluated until it’s iterated over in the for loop:

qs =Entry.objects.filter(pub_date__year=2006)

qs =qs.filter(headline__icontains="bill")

for ein qs:

print e.headline

24.5. Filtering Objects 315

The Django Book, Release 2.0

This prints all headlines from 2006 that contain “bill” but causes only one database hit.

•Printing it: A QuerySet is evaluated when you call repr() on it. This is for convenience in the Python

interactive interpreter, so you can immediately see your results when using the API interactively.

•Slicing: As explained in the upcoming “Limiting QuerySets” section, a QuerySet can be sliced using Python’s

array-slicing syntax. Usually slicing a QuerySet returns another (unevaluated)‘‘QuerySet‘‘, but Django will

execute the database query if you use the “step” parameter of slice syntax.

•Converting to a list: You can force evaluation of a QuerySet by calling list() on it, for example:

>>> entry_list =list(Entry.objects.all())

Be warned, though, that this could have a large memory overhead, because Django will load each element of

the list into memory. In contrast, iterating over a QuerySet will take advantage of your database to load data

and instantiate objects only as you need them.

Filtered QuerySets Are Unique

Each time you reﬁne a QuerySet, you get a brand-new QuerySet that is in no way bound to the previous

QuerySet. Each reﬁnement creates a separate and distinct QuerySet that can be stored, used, and reused:

q1 =Entry.objects.filter(headline__startswith="What")

q2 =q1.exclude(pub_date__gte=datetime.now())

q3 =q1.filter(pub_date__gte=datetime.now())

These three QuerySets are separate. The ﬁrst is a base QuerySet containing all entries that contain a headline

starting with “What”. The second is a subset of the ﬁrst, with an additional criterion that excludes records whose

pub_date is greater than now. The third is a subset of the ﬁrst, with an additional criterion that selects only the

records whose pub_date is greater than now. The initial QuerySet (q1) is unaffected by the reﬁnement process.

24.5.2 Limiting QuerySets

Use Python’s array-slicing syntax to limit your QuerySet to a certain number of results. This is the equivalent of

SQL’s LIMIT and OFFSET clauses.

For example, this returns the ﬁrst ﬁve entries (LIMIT 5):

>>> Entry.objects.all()[:5]

This returns the sixth through tenth entries (OFFSET 5 LIMIT 5):

>>> Entry.objects.all()[5:10]

Generally, slicing a QuerySet returns a new QuerySet – it doesn’t evaluate the query. An exception is if you use

the “step” parameter of Python slice syntax. For example, this would actually execute the query in order to return a

list of every second object of the ﬁrst ten:

>>> Entry.objects.all()[:10:2]

To retrieve a single object rather than a list (e.g., SELECT foo FROM bar LIMIT 1), use a simple index instead

of a slice. For example, this returns the ﬁrst Entry in the database, after ordering entries alphabetically by headline:

>>> Entry.objects.order_by(’headline’)[0]

This is roughly equivalent to the following:

>>> Entry.objects.order_by(’headline’)[0:1].get()

316 Chapter 24. Appendix B: Database API Reference

The Django Book, Release 2.0

Note, however, that the ﬁrst of these will raise IndexError while the second will raise DoesNotExist if no

objects match the given criteria.

24.5.3 Query Methods That Return New QuerySets

Django provides a range of QuerySet reﬁnement methods that modify either the types of results returned by the

QuerySet or the way its SQL query is executed. These methods are described in the sections that follow. Some of

the methods take ﬁeld lookup arguments, which are discussed in detail a bit later on.

ﬁlter(**lookup)

Returns a new QuerySet containing objects that match the given lookup parameters.

exclude(**lookup)

Returns a new QuerySet containing objects that do not match the given lookup parameters.

order_by(*ﬁelds)

By default, results returned by a QuerySet are ordered by the ordering tuple given by the ordering option in the

model’s metadata (see Appendix A). You can override this for a particular query using the order_by() method:

>> Entry.objects.filter(pub_date__year=2005).order_by(’-pub_date’, ’headline’)

This result will be ordered by pub_date descending, then by headline ascending. The negative sign in front of

"-pub_date" indicates descending order. Ascending order is assumed if the -is absent. To order randomly, use

"?", like so:

>>> Entry.objects.order_by(’?’)

Ordering randomly incurs a performance penalty, though, so you shouldn’t use it for anything with heavy load.

If no ordering is speciﬁed in a model’s class Meta and a QuerySet from that model doesn’t include

order_by(), then ordering will be undeﬁned and may differ from query to query.

distinct()

Returns a new QuerySet that uses SELECT DISTINCT in its SQL query. This eliminates duplicate rows from the

query results.

By default, a QuerySet will not eliminate duplicate rows. In practice, this is rarely a problem, because simple

queries such as Blog.objects.all() don’t introduce the possibility of duplicate result rows.

However, if your query spans multiple tables, it’s possible to get duplicate results when a QuerySet is evaluated.

That’s when you’d use distinct().

values(*ﬁelds)

Returns a special QuerySet that evaluates to a list of dictionaries instead of model-instance objects. Each of those

dictionaries represents an object, with the keys corresponding to the attribute names of model objects:

24.5. Filtering Objects 317

The Django Book, Release 2.0

# This list contains a Blog object.

>>> Blog.objects.filter(name__startswith=’Beatles’)

[Beatles Blog]

# This list contains a dictionary.

>>> Blog.objects.filter(name__startswith=’Beatles’).values()

[{’id’: 1, ’name’: ’Beatles Blog’, ’tagline’: ’All the latest Beatles news.’}]

values() takes optional positional arguments, *fields, which specify ﬁeld names to which the SELECT should

be limited. If you specify the ﬁelds, each dictionary will contain only the ﬁeld keys/values for the ﬁelds you specify.

If you don’t specify the ﬁelds, each dictionary will contain a key and value for every ﬁeld in the database table:

>>> Blog.objects.values()

[{’id’: 1, ’name’: ’Beatles Blog’, ’tagline’: ’All the latest Beatles news.’}],

>>> Blog.objects.values(’id’,’name’)

[{’id’: 1, ’name’: ’Beatles Blog’}]

This method is useful when you know you’re only going to need values from a small number of the available ﬁelds

and you won’t need the functionality of a model instance object. It’s more efﬁcient to select only the ﬁelds you need

to use.

dates(ﬁeld, kind, order)

Returns a special QuerySet that evaluates to a list of datetime.datetime objects representing all available

dates of a particular kind within the contents of the QuerySet.

The field argument must be the name of a DateField or DateTimeField of your model. The kind argument

must be either "year","month", or "day". Each datetime.datetime object in the result list is “truncated”

to the given type:

•"year" returns a list of all distinct year values for the ﬁeld.

•"month" returns a list of all distinct year/month values for the ﬁeld.

•"day" returns a list of all distinct year/month/day values for the ﬁeld.

order, which defaults to ’ASC’, should be either ’ASC’ or ’DESC’. This speciﬁes how to order the results.

Here are a few examples:

>>> Entry.objects.dates(’pub_date’,’year’)

[datetime.datetime(2005, 1, 1)]

>>> Entry.objects.dates(’pub_date’,’month’)

[datetime.datetime(2005, 2, 1), datetime.datetime(2005, 3, 1)]

>>> Entry.objects.dates(’pub_date’,’day’)

[datetime.datetime(2005, 2, 20), datetime.datetime(2005, 3, 20)]

>>> Entry.objects.dates(’pub_date’,’day’, order=’DESC’)

[datetime.datetime(2005, 3, 20), datetime.datetime(2005, 2, 20)]

>>> Entry.objects.filter(headline__contains=’Lennon’).dates(’pub_date’,’day’)

[datetime.datetime(2005, 3, 20)]

318 Chapter 24. Appendix B: Database API Reference

The Django Book, Release 2.0

select_related()

Returns a QuerySet that will automatically “follow” foreign key relationships, selecting that additional related-

object data when it executes its query. This is a performance booster that results in (sometimes much) larger queries

but means later use of foreign key relationships won’t require database queries.

The following examples illustrate the difference between plain lookups and select_related() lookups. Here’s

standard lookup:

# Hits the database.

>>> e = Entry.objects.get(id=5)

# Hits the database again to get the related Blog object.

>>> b = e.blog

And here’s select_related lookup:

# Hits the database.

>>> e = Entry.objects.select_related().get(id=5)

# Doesn’t hit the database, because e.blog has been prepopulated

# in the previous query.

>>> b = e.blog

select_related() follows foreign keys as far as possible. If you have the following models:

class City(models.Model):

# ...

class Person(models.Model):

# ...

hometown = models.ForeignKey(City)

class Book(models.Model):

# ...

author = models.ForeignKey(Person)

then a call to Book.objects.select_related().get(id=4) will cache the related Person and the related

City:

>>> b=Book.objects.select_related().get(id=4)

>>> p=b.author # Doesn’t hit the database.

>>> c=p.hometown # Doesn’t hit the database.

>>> b=Book.objects.get(id=4)# No select_related() in this example.

>>> p=b.author # Hits the database.

>>> c=p.hometown # Hits the database.

Note that select_related() does not follow foreign keys that have null=True.

Usually, using select_related() can vastly improve performance because your application can avoid many

database calls. However, in situations with deeply nested sets of relationships, select_related() can sometimes

end up following “too many” relations and can generate queries so large that they end up being slow.

24.5.4 QuerySet Methods That Do Not Return QuerySets

The following QuerySet methods evaluate the QuerySet and return something other than aQuerySet – a single

object, value, and so forth.

24.5. Filtering Objects 319

The Django Book, Release 2.0

get(**lookup)

Returns the object matching the given lookup parameters, which should be in the format described in the “Field

Lookups” section. This raises AssertionError if more than one object was found.

get() raises a DoesNotExist exception if an object wasn’t found for the given parameters. The DoesNotExist

exception is an attribute of the model class, for example:

>>> Entry.objects.get(id=’foo’)# raises Entry.DoesNotExist

The DoesNotExist exception inherits from django.core.exceptions.ObjectDoesNotExist, so you

can target multiple DoesNotExist exceptions:

>>> from django.core.exceptions import ObjectDoesNotExist

>>> try:

... e=Entry.objects.get(id=3)

... b=Blog.objects.get(id=1)

... except ObjectDoesNotExist:

... print "Either the entry or blog doesn’t exist."

create(**kwargs)

This is a convenience method for creating an object and saving it all in one step. It lets you compress two common

steps:

>>> p=Person(first_name="Bruce", last_name="Springsteen")

>>> p.save()

into a single line:

>>> p=Person.objects.create(first_name="Bruce", last_name="Springsteen")

get_or_create(**kwargs)

This is a convenience method for looking up an object and creating one if it doesn’t exist. It returns a tuple of

(object, created), where object is the retrieved or created object and created is a Boolean specifying

whether a new object was created.

This method is meant as a shortcut to boilerplate code and is mostly useful for data-import scripts. For example:

try:

obj =Person.objects.get(first_name=’John’, last_name=’Lennon’)

except Person.DoesNotExist:

obj =Person(first_name=’John’, last_name=’Lennon’, birthday=date(1940,10,9))

obj.save()

This pattern gets quite unwieldy as the number of ﬁelds in a model increases. The previous example can be rewritten

using get_or_create() like so:

obj, created =Person.objects.get_or_create(

first_name =’John’,

last_name =’Lennon’,

defaults ={’birthday’: date(1940,10,9)}

)

Any keyword arguments passed to get_or_create() –except an optional one called defaults – will be used

in a get() call. If an object is found, get_or_create() returns a tuple of that object and False. If an object is

320 Chapter 24. Appendix B: Database API Reference

The Django Book, Release 2.0

not found, get_or_create() will instantiate and save a new object, returning a tuple of the new object and True.

The new object will be created according to this algorithm:

defaults =kwargs.pop(’defaults’, {})

params =dict([(k, v) for k, v in kwargs.items() if ’__’ not in k])

params.update(defaults)

obj =self.model(**params)

obj.save()

In English, that means start with any non-’defaults’ keyword argument that doesn’t contain a double underscore

(which would indicate a nonexact lookup). Then add the contents of defaults, overriding any keys if necessary,

and use the result as the keyword arguments to the model class.

If you have a ﬁeld named defaults and want to use it as an exact lookup in get_or_create(), just use

’defaults__exact’ like so:

Foo.objects.get_or_create(

defaults__exact =’bar’,

defaults={’defaults’:’bar’}

)

Note: As mentioned earlier, get_or_create() is mostly useful in scripts that need to parse data and create new

records if existing ones aren’t available. But if you need to use get_or_create() in a view, please make sure to

use it only in POST requests unless you have a good reason not to. GET requests shouldn’t have any effect on data;

use POST whenever a request to a page has a side effect on your data.

count()

Returns an integer representing the number of objects in the database matching the QuerySet.count() never

raises exceptions. Here’s an example:

# Returns the total number of entries in the database.

>>> Entry.objects.count()

# Returns the number of entries whose headline contains ’Lennon’

>>> Entry.objects.filter(headline__contains=’Lennon’).count()

count() performs a SELECT COUNT(*)behind the scenes, so you should always use count() rather than

loading all of the records into Python objects and calling len() on the result.

Depending on which database you’re using (e.g., PostgreSQL or MySQL), count() may return a long integer instead

of a normal Python integer. This is an underlying implementation quirk that shouldn’t pose any real-world problems.

in_bulk(id_list)

Takes a list of primary key values and returns a dictionary mapping each primary key value to an instance of the object

with the given ID, for example:

>>> Blog.objects.in_bulk([1])

{1: Beatles Blog}

>>> Blog.objects.in_bulk([1,2])

{1: Beatles Blog, 2: Cheddar Talk}

>>> Blog.objects.in_bulk([])

{}

24.5. Filtering Objects 321

The Django Book, Release 2.0

IDs of objects that don’t exist are silently dropped from the result dictionary. If you pass in_bulk() an empty list,

you’ll get an empty dictionary.

latest(ﬁeld_name=None)

Returns the latest object in the table, by date, using the field_name provided as the date ﬁeld. This example returns

the latest Entry in the table, according to the pub_date ﬁeld:

>>> Entry.objects.latest(’pub_date’)

If your model’s Meta speciﬁes get_latest_by, you can leave off the field_name argument to latest().

Django will use the ﬁeld speciﬁed in get_latest_by by default.

Like get(),latest() raises DoesNotExist if an object doesn’t exist with the given parameters.

24.6 Field Lookups

Field lookups are how you specify the meat of an SQL WHERE clause. They’re speciﬁed as keyword arguments to the

QuerySet methods filter(),exclude(), and get().

Basic lookup keyword arguments take the form field__lookuptype=value (note the double underscore). For

example:

>>> Entry.objects.filter(pub_date__lte=’2006-01-01’)

translates (roughly) into the following SQL:

SELECT *FROM blog_entry WHERE pub_date <= ’2006-01-01’;

If you pass an invalid keyword argument, a lookup function will raise TypeError.

The supported lookup types follow.

24.6.1 exact

Performs an exact match:

>>> Entry.objects.get(headline__exact="Man bites dog")

This matches any object with the exact headline “Man bites dog”.

If you don’t provide a lookup type – that is, if your keyword argument doesn’t contain a double underscore – the

lookup type is assumed to be exact.

For example, the following two statements are equivalent:

>>> Blog.objects.get(id__exact=14)# Explicit form

>>> Blog.objects.get(id=14)# __exact is implied

This is for convenience, because exact lookups are the common case.

24.6.2 iexact

Performs a case-insensitive exact match:

322 Chapter 24. Appendix B: Database API Reference

The Django Book, Release 2.0

>>> Blog.objects.get(name__iexact=’beatles blog’)

This will match ’Beatles Blog’,’beatles blog’,’BeAtLes BLoG’, and so forth.

24.6.3 contains

Performs a case-sensitive containment test:

Entry.objects.get(headline__contains=’Lennon’)

This will match the headline ’Today Lennon honored’ but not ’today lennon honored’.

SQLite doesn’t support case-sensitive LIKE statements; when using SQLite,‘‘contains‘‘ acts like icontains.

Escaping Percent Signs and Underscores in LIKE Statements

The ﬁeld lookups that equate to LIKE SQL statements (iexact,contains,icontains,startswith,

istartswith,endswith, and iendswith) will automatically escape the two special characters used in LIKE

statements – the percent sign and the underscore. (In a LIKE statement, the percent sign signiﬁes a multiple-character

wildcard and the underscore signiﬁes a single-character wildcard.)

This means things should work intuitively, so the abstraction doesn’t leak. For example, to retrieve all the entries that

contain a percent sign, just use the percent sign as any other character:

Entry.objects.filter(headline__contains=’%’)

Django takes care of the quoting for you. The resulting SQL will look something like this:

SELECT ... WHERE headline LIKE ’%\%%’;

The same goes for underscores. Both percentage signs and underscores are handled for you transparently.

24.6.4 icontains

Performs a case-insensitive containment test:

>>> Entry.objects.get(headline__icontains=’Lennon’)

Unlike contains,icontains will match ’today lennon honored’.

24.6.5 gt, gte, lt, and lte

These represent greater than, greater than or equal to, less than, and less than or equal to:

>>> Entry.objects.filter(id__gt=4)

>>> Entry.objects.filter(id__lt=15)

>>> Entry.objects.filter(id__gte=0)

These queries return any object with an ID greater than 4, an ID less than 15, and an ID greater than or equal to 1,

respectively.

You’ll usually use these on numeric ﬁelds. Be careful with character ﬁelds since character order isn’t always what

you’d expect (i.e., the string “4” sorts after the string “10”).

24.6. Field Lookups 323

The Django Book, Release 2.0

24.6.6 in

Filters where a value is on a given list:

Entry.objects.filter(id__in=[1,3,4])

This returns all objects with the ID 1, 3, or 4.

24.6.7 startswith

Performs a case-sensitive starts-with:

>>> Entry.objects.filter(headline__startswith=’Will’)

This will return the headlines “Will he run?” and “Willbur named judge”, but not “Who is Will?” or “will found in

crypt”.

24.6.8 istartswith

Performs a case-insensitive starts-with:

>>> Entry.objects.filter(headline__istartswith=’will’)

This will return the headlines “Will he run?”, “Willbur named judge”, and “will found in crypt”, but not “Who is

Will?”

24.6.9 endswith and iendswith

Perform case-sensitive and case-insensitive ends-with:

>>> Entry.objects.filter(headline__endswith=’cats’)

>>> Entry.objects.filter(headline__iendswith=’cats’)

Similar to startswith and istartswith.

24.6.10 range

Performs an inclusive range check:

>>> start_date =datetime.date(2005,1,1)

>>> end_date =datetime.date(2005,3,31)

>>> Entry.objects.filter(pub_date__range=(start_date, end_date))

You can use range anywhere you can use BETWEEN in SQL – for dates, numbers, and even characters.

24.6.11 year, month, and day

For date/datetime ﬁelds, perform exact year, month, or day matches:

# Return all entries published in 2005

>>>Entry.objects.filter(pub_date__year=2005)

# Return all entries published in December

324 Chapter 24. Appendix B: Database API Reference

The Django Book, Release 2.0

>>> Entry.objects.filter(pub_date__month=12)

# Return all entries published on the 3rd of the month

>>> Entry.objects.filter(pub_date__day=3)

# Combination: return all entries on Christmas of any year

>>> Entry.objects.filter(pub_date__month=12, pub_date_day=25)

24.6.12 isnull

Takes either True or False, which correspond to SQL queries of IS NULL and IS NOT NULL, respectively:

>>> Entry.objects.filter(pub_date__isnull=True)

24.6.13 search

A Boolean full-text search that takes advantage of full-text indexing. This is like contains but is signiﬁcantly faster

due to full-text indexing.

Note this is available only in MySQL and requires direct manipulation of the database to add the full-text index.

24.6.14 The pk Lookup Shortcut

For convenience, Django provides a pk lookup type, which stands for “primary_key”.

In the example Blog model, the primary key is the id ﬁeld, so these three statements are equivalent:

>>> Blog.objects.get(id__exact=14)# Explicit form

>>> Blog.objects.get(id=14)# __exact is implied

>>> Blog.objects.get(pk=14)# pk implies id__exact

The use of pk isn’t limited to __exact queries – any query term can be combined with pk to perform a query on the

primary key of a model:

# Get blogs entries with id 1, 4, and 7

>>> Blog.objects.filter(pk__in=[1,4,7])

# Get all blog entries with id > 14

>>> Blog.objects.filter(pk__gt=14)

pk lookups also work across joins. For example, these three statements are equivalent:

>>> Entry.objects.filter(blog__id__exact=3)# Explicit form

>>> Entry.objects.filter(blog__id=3)# __exact is implied

>>> Entry.objects.filter(blog__pk=3)# __pk implies __id__exact

The point of pk is to give you a generic way to refer to the primary key in cases where you’re not sure whether the

model’s primary key is called id.

24.7 Complex Lookups with Q Objects

Keyword argument queries – in filter() and so on – are ANDed together. If you need to execute more complex

queries (e.g., queries with OR statements), you can use Qobjects.

24.7. Complex Lookups with Q Objects 325

The Django Book, Release 2.0

AQobject (django.db.models.Q) is an object used to encapsulate a collection of keyword arguments. These

keyword arguments are speciﬁed as in the “Field Lookups” section.

For example, this Qobject encapsulates a single LIKE query:

Q(question__startswith=’What’)

Qobjects can be combined using the &and |operators. When an operator is used on two Qobjects, it

yields a new Qobject. For example, this statement yields a single Qobject that represents the OR of two

"question__startswith" queries:

Q(question__startswith=’Who’)|Q(question__startswith=’What’)

This is equivalent to the following SQL WHERE clause:

WHERE question LIKE ’Who%’ OR question LIKE ’What%’

You can compose statements of arbitrary complexity by combining Qobjects with the &and |operators. You can also

use parenthetical grouping.

Each lookup function that takes keyword arguments (e.g., filter(),exclude(),get()) can also be passed

one or more Qobjects as positional (not-named) arguments. If you provide multiple Qobject arguments to a lookup

function, the arguments will be ANDed together, for example:

Poll.objects.get(

Q(question__startswith=’Who’),

Q(pub_date=date(2005,5,2)) |Q(pub_date=date(2005,5,6))

)

roughly translates into the following SQL:

SELECT *from polls WHERE question LIKE ’Who%’

AND (pub_date = ’2005-05-02’ OR pub_date = ’2005-05-06’)

Lookup functions can mix the use of Qobjects and keyword arguments. All arguments provided to a lookup function

(be they keyword arguments or Qobjects) are ANDed together. However, if a Qobject is provided, it must precede the

deﬁnition of any keyword arguments. For example, the following:

Poll.objects.get(

Q(pub_date=date(2005,5,2)) |Q(pub_date=date(2005,5,6)),

question__startswith=’Who’)

would be a valid query, equivalent to the previous example, but this:

# INVALID QUERY

Poll.objects.get(

question__startswith=’Who’,

Q(pub_date=date(2005,5,2)) |Q(pub_date=date(2005,5,6)))

would not be valid.

You can ﬁnd some examples online at http://www.djangoproject.com/documentation/models/or_lookups/.

24.8 Related Objects

When you deﬁne a relationship in a model (i.e., a ForeignKey,OneToOneField, or ManyToManyField),

instances of that model will have a convenient API to access the related object(s).

For example, an Entry object ecan get its associated Blog object by accessing the blog attribute e.blog.

326 Chapter 24. Appendix B: Database API Reference

The Django Book, Release 2.0

Django also creates API accessors for the “other” side of the relationship – the link from the related model to the

model that deﬁnes the relationship. For example, a Blog object bhas access to a list of all related Entry objects via

the entry_set attribute: b.entry_set.all().

All examples in this section use the sample Blog,Author, and Entry models deﬁned at the start of the appendix.

24.8.1 Lookups That Span Relationships

Django offers a powerful and intuitive way to “follow” relationships in lookups, taking care of the SQL JOINs for

you automatically behind the scenes. To span a relationship, just use the ﬁeld name of related ﬁelds across models,

separated by double underscores, until you get to the ﬁeld you want.

This example retrieves all Entry objects with a Blog whose name is ’Beatles Blog’:

>>> Entry.objects.filter(blog__name__exact=’Beatles Blog’)

This spanning can be as deep as you’d like.

It works backward, too. To refer to a “reverse” relationship, just use the lowercase name of the model.

This example retrieves all Blog objects that have at least one Entry whose headline contains ’Lennon’:

>>> Blog.objects.filter(entry__headline__contains=’Lennon’)

24.8.2 Foreign Key Relationships

If a model has a ForeignKey, instances of that model will have access to the related (foreign) object via a simple

attribute of the model, for example:

e=Entry.objects.get(id=2)

e.blog # Returns the related Blog object.

You can get and set via a foreign key attribute. As you may expect, changes to the foreign key aren’t saved to the

database until you call save(), for example:

e=Entry.objects.get(id=2)

e.blog =some_blog

e.save()

If a ForeignKey ﬁeld has null=True set (i.e., it allows NULL values), you can set it to NULL by assigning None

to it and saving:

e=Entry.objects.get(id=2)

e.blog =None

e.save() # "UPDATE blog_entry SET blog_id = NULL ...;"

Forward access to one-to-many relationships is cached the ﬁrst time the related object is accessed. Subsequent accesses

to the foreign key on the same object instance are cached, for example:

e=Entry.objects.get(id=2)

print e.blog # Hits the database to retrieve the associated Blog.

print e.blog # Doesn’t hit the database; uses cached version.

Note that the select_related() QuerySet method recursively prepopulates the cache of all one-to-many rela-

tionships ahead of time:

e=Entry.objects.select_related().get(id=2)

print e.blog # Doesn’t hit the database; uses cached version.

24.8. Related Objects 327

The Django Book, Release 2.0

select_related() is documented in the “QuerySet Methods That Return New QuerySets” section.

24.8.3 “Reverse” Foreign Key Relationships

Foreign key relationships are automatically symmetrical – a reverse relationship is inferred from the presence of a

ForeignKey pointing to another model.

If a model has a ForeignKey, instances of the foreign key model will have access to a Manager that returns all

instances of the ﬁrst model that relate to that object. By default, this Manager is named FOO_set, where FOO is

the source model name, lowercased. This Manager returns QuerySets, which can be ﬁltered and manipulated as

described in the “Retrieving Objects” section.

Here’s an example:

b=Blog.objects.get(id=1)

b.entry_set.all() # Returns all Entry objects related to Blog.

# b.entry_set is a Manager that returns QuerySets.

b.entry_set.filter(headline__contains=’Lennon’)

b.entry_set.count()

You can override the FOO_set name by setting the related_name parameter in the ForeignKey()

deﬁnition. For example, if the Entry model was altered to blog = ForeignKey(Blog,

related_name=’entries’), the preceding example code would look like this:

b=Blog.objects.get(id=1)

b.entries.all() # Returns all Entry objects related to Blog.

# b.entries is a Manager that returns QuerySets.

b.entries.filter(headline__contains=’Lennon’)

b.entries.count()

related_name is particularly useful if a model has two foreign keys to the same second model.

You cannot access a reverse ForeignKey Manager from the class; it must be accessed from an instance:

Blog.entry_set # Raises AttributeError: "Manager must be accessed via instance".

In addition to the QuerySet methods deﬁned in the “Retrieving Objects” section, the ForeignKey Manager has

these additional methods:

•add(obj1, obj2, ...): Adds the speciﬁed model objects to the related object set, for example:

b=Blog.objects.get(id=1)

e=Entry.objects.get(id=234)

b.entry_set.add(e) # Associates Entry e with Blog b.

•create(**kwargs): Creates a new object, saves it, and puts it in the related object set. It returns the newly

created object:

b=Blog.objects.get(id=1)

e=b.entry_set.create(headline=’Hello’, body_text=’Hi’, pub_date=datetime.date(2005,1,1))

# No need to call e.save() at this point -- it’s already been saved.

This is equivalent to (but much simpler than) the following:

b=Blog.objects.get(id=1)

e=Entry(blog=b, headline=’Hello’, body_text=’Hi’, pub_date=datetime.date(2005,1,1))

e.save()

328 Chapter 24. Appendix B: Database API Reference

The Django Book, Release 2.0

Note that there’s no need to specify the keyword argument of the model that deﬁnes the relationship. In the

preceding example, we don’t pass the parameter blog to create(). Django ﬁgures out that the new Entry

object’s blog ﬁeld should be set to b.

•remove(obj1, obj2, ...): Removes the speciﬁed model objects from the related object set:

b=Blog.objects.get(id=1)

e=Entry.objects.get(id=234)

b.entry_set.remove(e) # Disassociates Entry e from Blog b.

In order to prevent database inconsistency, this method only exists on ForeignKey objects where

null=True. If the related ﬁeld can’t be set to None (NULL), then an object can’t be removed from a relation

without being added to another. In the preceding example, removing efrom b.entry_set() is equivalent

to doing e.blog = None, and because the blog ForeignKey doesn’t have null=True, this is invalid.

•clear(): Removes all objects from the related object set:

b=Blog.objects.get(id=1)

b.entry_set.clear()

Note this doesn’t delete the related objects – it just disassociates them.

Just like remove(),clear() is only available on ForeignKey‘‘s where ‘‘null=True.

To assign the members of a related set in one fell swoop, just assign to it from any iterable object, for example:

b=Blog.objects.get(id=1)

b.entry_set =[e1, e2]

If the clear() method is available, any pre-existing objects will be removed from the entry_set before all

objects in the iterable (in this case, a list) are added to the set. If the clear() method is not available, all objects in

the iterable will be added without removing any existing elements.

Each “reverse” operation described in this section has an immediate effect on the database. Every addition, creation,

and deletion is immediately and automatically saved to the database.

24.8.4 Many-to-Many Relationships

Both ends of a many-to-many relationship get automatic API access to the other end. The API works just as a “reverse”

one-to-many relationship (described in the previous section).

The only difference is in the attribute naming: the model that deﬁnes the ManyToManyField uses the attribute name

of that ﬁeld itself, whereas the “reverse” model uses the lowercased model name of the original model, plus ’_set’

(just like reverse one-to-many relationships).

An example makes this concept easier to understand:

e=Entry.objects.get(id=3)

e.authors.all() # Returns all Author objects for this Entry.

e.authors.count()

e.authors.filter(name__contains=’John’)

a=Author.objects.get(id=5)

a.entry_set.all() # Returns all Entry objects for this Author.

Like ForeignKey,ManyToManyField can specify related_name. In the preceding example, if the

ManyToManyField in Entry had speciﬁed related_name=’entries’, then each Author instance would

have an entries attribute instead of entry_set.

How Are the Backward Relationships Possible?

24.8. Related Objects 329

The Django Book, Release 2.0

Other object-relational mappers require you to deﬁne relationships on both sides. The Django developers believe this

is a violation of the DRY (Don’t Repeat Yourself) principle, so Django requires you to deﬁne the relationship on only

one end. But how is this possible, given that a model class doesn’t know which other model classes are related to it

until those other model classes are loaded?

The answer lies in the INSTALLED_APPS setting. The ﬁrst time any model is loaded, Django iterates over every

model in INSTALLED_APPS and creates the backward relationships in memory as needed. Essentially, one of the

functions of INSTALLED_APPS is to tell Django the entire model domain.

24.8.5 Queries Over Related Objects

Queries involving related objects follow the same rules as queries involving normal value ﬁelds. When specifying the

value for a query to match, you may use either an object instance itself or the primary key value for the object.

For example, if you have a Blog object bwith id=5, the following three queries would be identical:

Entry.objects.filter(blog=b) # Query using object instance

Entry.objects.filter(blog=b.id) # Query using id from instance

Entry.objects.filter(blog=5)# Query using id directly

24.9 Deleting Objects

The delete method, conveniently, is named delete(). This method immediately deletes the object and has no return

value:

e.delete()

You can also delete objects in bulk. Every QuerySet has a delete() method, which deletes all members of that

QuerySet. For example, this deletes all Entry objects with a pub_date year of 2005:

Entry.objects.filter(pub_date__year=2005).delete()

When Django deletes an object, it emulates the behavior of the SQL constraint ON DELETE CASCADE – in other

words, any objects that had foreign keys pointing at the object to be deleted will be deleted along with it, for example:

b=Blog.objects.get(pk=1)

# This will delete the Blog and all of its Entry objects.

b.delete()

Note that delete() is the only QuerySet method that is not exposed on a Manager itself. This is a safety

mechanism to prevent you from accidentally requesting Entry.objects.delete() and deleting all the entries.

If you do want to delete all the objects, then you have to explicitly request a complete query set:

Entry.objects.all().delete()

24.10 Shortcuts

As you develop views, you will discover a number of common idioms in the way you use the database API. Django

encodes some of these idioms as shortcuts that can be used to simplify the process of writing views. These functions

are in the django.shortcuts module.

330 Chapter 24. Appendix B: Database API Reference

The Django Book, Release 2.0

24.10.1 get_object_or_404()

One common idiom to use get() and raise Http404 if the object doesn’t exist. This idiom is captured by

get_object_or_404(). This function takes a Django model as its ﬁrst argument and an arbitrary number of

keyword arguments, which it passes to the default manager’s get() function. It raises Http404 if the object doesn’t

exist, for example:

# Get the Entry with a primary key of 3

e=get_object_or_404(Entry, pk=3)

When you provide a model to this shortcut function, the default manager is used to execute the underlying get()

query. If you don’t want to use the default manager, or if you want to search a list of related objects, you can provide

get_object_or_404() with a Manager object instead:

# Get the author of blog instance e with a name of ’Fred’

a=get_object_or_404(e.authors, name=’Fred’)

# Use a custom manager ’recent_entries’ in the search for an

# entry with a primary key of 3

e=get_object_or_404(Entry.recent_entries, pk=3)

24.10.2 get_list_or_404()

get_list_or_404 behaves the same way as get_object_or_404(), except that it uses filter() instead

of get(). It raises Http404 if the list is empty.

24.11 Falling Back to Raw SQL

If you ﬁnd yourself needing to write an SQL query that is too complex for Django’s database mapper to handle, you

can fall back into raw SQL statement mode.

The preferred way to do this is by giving your model custom methods or custom manager methods that execute queries.

Although there’s nothing in Django that requires database queries to live in the model layer, this approach keeps all

your data access logic in one place, which is smart from a code organization standpoint. For instructions, see Appendix

Finally, it’s important to note that the Django database layer is merely an interface to your database. You can access

your database via other tools, programming languages, or database frameworks – there’s nothing Django-speciﬁc

about your database.

24.11. Falling Back to Raw SQL 331

The Django Book, Release 2.0

332 Chapter 24. Appendix B: Database API Reference

CHAPTER 25

Appendix C: Generic View Reference

Chapter 11 introduced generic views but leaves out some of the gory details. This appendix describes each generic

view along with all the options each view can take. Be sure to read Chapter 11 before trying to understand the reference

material that follows. You might want to refer back to the Book,Publisher, and Author objects deﬁned in that

chapter; the examples that follow use these models.

25.1 Common Arguments to Generic Views

Most of these views take a large number of arguments that can change the generic view’s behavior. Many of these

arguments work the same across a large number of views. Table C-1 describes each of these common arguments;

anytime you see one of these arguments in a generic view’s argument list, it will work as described in the table.

Table 25.1: Table C-1. Common Arguments to Generic Views

Argument Description

allow_empty A Boolean specifying whether to display the page if no objects are available. If this is False

and no objects are available, the view will raise a 404 error instead of displaying an empty

page. By default, this is True.

context_processorsA list of additional template-context processors (besides the defaults) to apply to the view’s

template. See Chapter 9 for information on template context processors.

extra_context A dictionary of values to add to the template context. By default, this is an empty dictionary.

If a value in the dictionary is callable, the generic view will call it just before rendering the

template.

mimetype The MIME type to use for the resulting document. It defaults to the value of the

DEFAULT_MIME_TYPE setting, which is text/html if you haven’t changed it.

queryset AQuerySet (i.e., something like Author.objects.all()) to read objects from. See

Appendix B for more information about QuerySet objects. Most generic views require this

argument.

template_loaderThe template loader to use when loading the template. By default, it’s

django.template.loader. See Chapter 9 for information on template loaders.

template_name The full name of a template to use in rendering the page. This lets you override the default

template name derived from the QuerySet.

template_object_nameThe name of the template variable to use in the template context. By default, this is

’object’. Views that list more than one object (i.e., object_list views and various

objects-for-date views) will append ’_list’ to the value of this parameter.

333

The Django Book, Release 2.0

25.2 “Simple” Generic Views

The module‘‘django.views.generic.simple‘‘ contains simple views that handle a couple of common cases: rendering a

template when no view logic is needed and issuing a redirect.

25.2.1 Rendering a Template

View function:django.views.generic.simple.direct_to_template

This view renders a given template, passing it a {{ params }} template variable, which is a dictionary of the

parameters captured in the URL.

Example

Given the following URLconf:

from django.conf.urls.defaults import *

from django.views.generic.simple import direct_to_template

urlpatterns =patterns(’’,

(r’^foo/$’, direct_to_template, {’template’:’foo_index.html’}),

(r’^foo/(?P<id>\d+)/$’, direct_to_template, {’template’:’foo_detail.html’}),

)

a request to /foo/ would render the template foo_index.html, and a request to /foo/15/ would render

foo_detail.html with a context variable {{ params.id }} that is set to 15.

Required Arguments

•template: The full name of a template to use.

25.2.2 Redirecting to Another URL

View function:django.views.generic.simple.redirect_to

This view redirects to another URL. The given URL may contain dictionary-style string formatting, which will be

interpolated against the parameters captured in the URL.

If the given URL is None, Django will return an HTTP 410 (“Gone”) message.

Example

This URLconf redirects from /foo/<id>/ to /bar/<id>/:

from django.conf.urls.defaults import *

from django.views.generic.simple import redirect_to

urlpatterns =patterns(’django.views.generic.simple’,

(’^foo/(?p<id>\d+)/$’, redirect_to, {’url’:’/bar/%(id)s/’}),

)

This example returns a “Gone” response for requests to /bar/:

334 Chapter 25. Appendix C: Generic View Reference

The Django Book, Release 2.0

from django.views.generic.simple import redirect_to

urlpatterns =patterns(’django.views.generic.simple’,

(’^bar/$’, redirect_to, {’url’:None}),

)

Required Arguments

•url: The URL to redirect to, as a string. Or None to return a 410 (“Gone”) HTTP response.

25.3 List/Detail Generic Views

The list/detail generic views (in the module django.views.generic.list_detail) handle the common case

of displaying a list of items at one view and individual “detail” views of those items at another.

25.3.1 Lists of Objects

View function:django.views.generic.list_detail.object_list

Use this view to display a page representing a list of objects.

Example

Given the Author object from Chapter 5, we can use the object_list view to show a simple list of all authors

given the following URLconf snippet:

from mysite.books.models import Author

from django.conf.urls.defaults import *

from django.views.generic import list_detail

author_list_info ={

’queryset’: Author.objects.all(),

}

urlpatterns =patterns(’’,

(r’authors/$’, list_detail.object_list, author_list_info)

)

Required Arguments

•queryset: A QuerySet of objects to list (see Table C-1).

Optional Arguments

•paginate_by: An integer specifying how many objects should be displayed per page. If this is given, the view

will paginate objects with paginate_by objects per page. The view will expect either a page query string

parameter (via GET) containing a zero-indexed page number, or a page variable speciﬁed in the URLconf. See

the following “Notes on Pagination” section.

Additionally, this view may take any of these common arguments described in Table C-1:

25.3. List/Detail Generic Views 335

The Django Book, Release 2.0

•allow_empty

•context_processors

•extra_context

•mimetype

•template_loader

•template_name

•template_object_name

Template Name

If template_name isn’t speciﬁed, this view will use the template <app_label>/<model_name>_list.html

by default. Both the application label and the model name are derived from the queryset parameter. The application

label is the name of the application that the model is deﬁned in, and the model name is the lowercased version of the

name of the model class.

In the previous example using Author.objects.all() as the queryset, the application label would be books

and the model name would be author. This means the default template would be books/author_list.html.

Template Context

In addition to extra_context, the template’s context will contain the following:

•object_list: The list of objects. This variable’s name depends on the template_object_name pa-

rameter, which is ’object’ by default. If template_object_name is ’foo’, this variable’s name will

be foo_list.

•is_paginated: A Boolean representing whether the results are paginated. Speciﬁcally, this is set to False

if the number of available objects is less than or equal to paginate_by.

If the results are paginated, the context will contain these extra variables:

•results_per_page: The number of objects per page. (This is the same as the paginate_by parameter.)

•has_next: A Boolean representing whether there’s a next page.

•has_previous: A Boolean representing whether there’s a previous page.

•page: The current page number, as an integer. This is 1-based.

•next: The next page number, as an integer. If there’s no next page, this will still be an integer representing the

theoretical next-page number. This is 1-based.

•previous: The previous page number, as an integer. This is 1-based.

•pages: The total number of pages, as an integer.

•hits: The total number of objects across all pages, not just this page.

A Note on Pagination

If paginate_by is speciﬁed, Django will paginate the results. You can specify the page number in the URL in one

of two ways:

• Use the page parameter in the URLconf. For example, this is what your URLconf might look like:

(r’^objects/page(?P<page>[0-9]+)/$’,’object_list’,dict(info_dict))

336 Chapter 25. Appendix C: Generic View Reference

The Django Book, Release 2.0

• Pass the page number via the page query-string parameter. For example, a URL would look like this:

/objects/?page=3

In both cases, page is 1-based, not 0-based, so the ﬁrst page would be represented as page 1.

25.3.2 Detail Views

View function:django.views.generic.list_detail.object_detail

This view provides a “detail” view of a single object.

Example

Continuing the previous object_list example, we could add a detail view for a given author by modifying the

URLconf:

from mysite.books.models import Author

from django.conf.urls.defaults import *

from django.views.generic import list_detail

author_list_info = {

‘queryset’ : Author.objects.all(),

}

author_detail_info = {

‘‘queryset’’ : Author.objects.all(),

‘‘template_object_name’’ : ‘‘author’’,

}

urlpatterns = patterns(‘’,

(r’authors/$’, list_detail.object_list, author_list_info),

(r’^authors/(?P<object_id>d+)/$’, list_detail.object_detail, author_detail_info),

)

Required Arguments

•queryset: A QuerySet that will be searched for the object (see Table C-1).

and either

•object_id: The value of the primary-key ﬁeld for the object.

•slug: The slug of the given object. If you pass this ﬁeld, then the slug_field argument (see the following

section) is also required.

Optional Arguments

•slug_field: The name of the ﬁeld on the object containing the slug. This is required if you are using the

slug argument, but it must be absent if you’re using the object_id argument.

25.3. List/Detail Generic Views 337

The Django Book, Release 2.0

•template_name_field: The name of a ﬁeld on the object whose value is the template name to use. This

lets you store template names in your data.

In other words, if your object has a ﬁeld ’the_template’ that contains a string ’foo.html’, and you set

template_name_field to ’the_template’, then the generic view for this object will use the template

’foo.html’.

If the template named by template_name_field doesn’t exist, the one named by template_name is

used instead. It’s a bit of a brain-bender, but it’s useful in some cases.

This view may also take these common arguments (see Table C-1):

•context_processors

•extra_context

•mimetype

•template_loader

•template_name

•template_object_name

Template Name

If template_name and template_name_field aren’t speciﬁed, this view will use the template

<app_label>/<model_name>_detail.html by default.

Template Context

In addition to extra_context, the template’s context will be as follows:

•object: The object. This variable’s name depends on the template_object_name parameter, which is

’object’ by default. If template_object_name is ’foo’, this variable’s name will be foo.

25.4 Date-Based Generic Views

Date-based generic views are generally used to provide a set of “archive” pages for dated material. Think

year/month/day archives for a newspaper, or a typical blog archive.

Tip:

By default, these views ignore objects with dates in the future.

This means that if you try to visit an archive page in the future, Django will automatically show a 404 (“Page not

found”) error, even if there are objects published that day.

Thus, you can publish postdated objects that don’t appear publicly until their desired publication date.

However, for different types of date-based objects, this isn’t appropriate (e.g., a calendar of upcoming events). For

these views, setting the allow_future option to True will make the future objects appear (and allow users to visit

“future” archive pages).

338 Chapter 25. Appendix C: Generic View Reference

The Django Book, Release 2.0

25.4.1 Archive Index

View function:django.views.generic.date_based.archive_index

This view provides a top-level index page showing the “latest” (i.e., most recent) objects by date.

Example

Say a typical book publisher wants a page of recently published books. Given some Book object with a

publication_date ﬁeld, we can use the archive_index view for this common task:

from mysite.books.models import Book

from django.conf.urls.defaults import *

from django.views.generic import date_based

book_info ={

"queryset" : Book.objects.all(),

"date_field" :"publication_date"

}

urlpatterns =patterns(’’,

(r’^books/$’, date_based.archive_index, book_info),

)

Required Arguments

•date_field: The name of the DateField or DateTimeField in the QuerySet‘s model that the date-

based archive should use to determine the objects on the page.

•queryset: A QuerySet of objects for which the archive serves.

Optional Arguments

•allow_future: A Boolean specifying whether to include “future” objects on this page, as described in the

previous note.

•num_latest: The number of latest objects to send to the template context. By default, it’s 15.

This view may also take these common arguments (see Table C-1):

•allow_empty

•context_processors

•extra_context

•mimetype

•template_loader

•template_name

Template Name

If template_name isn’t speciﬁed, this view will use the template <app_label>/<model_name>_archive.html

by default.

25.4. Date-Based Generic Views 339

The Django Book, Release 2.0

Template Context

In addition to extra_context, the template’s context will be as follows:

•date_list: A list of datetime.date objects representing all years that have objects available according

to queryset. These are ordered in reverse.

For example, if you have blog entries from 2003 through 2006, this list will contain four datetime.date

objects: one for each of those years.

•latest: The num_latest objects in the system, in descending order by date_field. For example, if

num_latest is 10, then latest will be a list of the latest ten objects in queryset.

25.4.2 Year Archives

View function:django.views.generic.date_based.archive_year

Use this view for yearly archive pages. These pages have a list of months in which objects exists, and they can

optionally display all the objects published in a given year.

Example

Extending the archive_index example from earlier, we’ll add a way to view all the books published in a given

year:

from mysite.books.models import Book

from django.conf.urls.defaults import *

from django.views.generic import date_based

book_info = {

‘‘queryset’’ : Book.objects.all(),

‘‘date_field’’ : ‘‘publication_date’’

}

urlpatterns = patterns(‘’,

(r’^books/$’, date_based.archive_index, book_info),

(r’^books/(?P<year>d{4})/?$’, date_based.archive_year, book_info),

)

Required Arguments

•date_field: As for archive_index (see the previous section).

•queryset: A QuerySet of objects for which the archive serves.

•year: The four-digit year for which the archive serves (as in our example, this is usually taken from a URL

parameter).

Optional Arguments

•make_object_list: A Boolean specifying whether to retrieve the full list of objects for this year and pass

those to the template. If True, this list of objects will be made available to the template as object_list.

(The name object_list may be different; see the information about object_list in the following “Tem-

plate Context” section.) By default, this is False.

340 Chapter 25. Appendix C: Generic View Reference

The Django Book, Release 2.0

•allow_future: A Boolean specifying whether to include “future” objects on this page.

This view may also take these common arguments (see Table C-1):

•allow_empty

•context_processors

•extra_context

•mimetype

•template_loader

•template_name

•template_object_name

Template Name

If template_name isn’t speciﬁed, this view will use the template <app_label>/<model_name>_archive_year.html

by default.

Template Context

In addition to extra_context, the template’s context will be as follows:

•date_list: A list of datetime.date objects representing all months that have objects available in the

given year, according to queryset, in ascending order.

•year: The given year, as a four-character string.

•object_list: If the make_object_list parameter is True, this will be set to a list of objects available

for the given year, ordered by the date ﬁeld. This variable’s name depends on the template_object_name

parameter, which is ’object’ by default. If template_object_name is ’foo’, this variable’s name

will be foo_list.

If make_object_list is False,object_list will be passed to the template as an empty list.

25.4.3 Month Archives

View function:django.views.generic.date_based.archive_month

This view provides monthly archive pages showing all objects for a given month.

Example

Continuing with our example, adding month views should look familiar:

urlpatterns = patterns(‘’,

(r’^books/$’, date_based.archive_index, book_info),

(r’^books/(?P<year>d{4})/?$’, date_based.archive_year, book_info),

(

r’^(?P<year>d{4})/(?P<month>[a-z]{3})/$’,

date_based.archive_month,

book_info

)

25.4. Date-Based Generic Views 341

The Django Book, Release 2.0

Required Arguments

•year: The four-digit year for which the archive serves (a string).

•month: The month for which the archive serves, formatted according to the month_format argument.

•queryset: A QuerySet of objects for which the archive serves.

•date_field: The name of the DateField or DateTimeField in the QuerySet‘s model that the date-

based archive should use to determine the objects on the page.

Optional Arguments

•month_format: A format string that regulates what format the month parameter uses. This should

be in the syntax accepted by Python’s time.strftime. (See Python’s strftime documentation at

http://docs.python.org/library/time.html#time.strftime.) It’s set to "%b" by default, which is a three-letter month

abbreviation (i.e., “jan”, “feb”, etc.). To change it to use numbers, use "%m".

•allow_future: A Boolean specifying whether to include “future” objects on this page, as described in the

previous note.

This view may also take these common arguments (see Table C-1):

•allow_empty

•context_processors

•extra_context

•mimetype

•template_loader

•template_name

•template_object_name

Template Name

If template_name isn’t speciﬁed, this view will use the template <app_label>/<model_name>_archive_month.html

by default.

Template Context

In addition to extra_context, the template’s context will be as follows:

•month: A datetime.date object representing the given month.

•next_month: A datetime.date object representing the ﬁrst day of the next month. If the next month is

in the future, this will be None.

•previous_month: A datetime.date object representing the ﬁrst day of the previous month. Unlike

next_month, this will never be None.

•object_list: A list of objects available for the given month. This variable’s name depends on the

template_object_name parameter, which is ’object’ by default. If template_object_name is

’foo’, this variable’s name will be foo_list.

342 Chapter 25. Appendix C: Generic View Reference

The Django Book, Release 2.0

25.4.4 Week Archives

View function:django.views.generic.date_based.archive_week

This view shows all objects in a given week.

Note: For the sake of consistency with Python’s built-in date/time handling, Django assumes that the ﬁrst day of the

week is Sunday.

Example

urlpatterns = patterns(‘’,

# ...

(

r’^(?P<year>d{4})/(?P<week>d{2})/$’,

date_based.archive_week,

book_info

)

Required Arguments

•year: The four-digit year for which the archive serves (a string).

•week: The week of the year for which the archive serves (a string).

•queryset: A QuerySet of objects for which the archive serves.

•date_field: The name of the DateField or DateTimeField in the QuerySet‘s model that the date-

based archive should use to determine the objects on the page.

Optional Arguments

•allow_future: A Boolean specifying whether to include “future” objects on this page, as described in the

previous note.

This view may also take these common arguments (see Table C-1):

•allow_empty

•context_processors

•extra_context

•mimetype

•template_loader

•template_name

•template_object_name

Template Name

If template_name isn’t speciﬁed, this view will use the template <app_label>/<model_name>_archive_week.html

by default.

25.4. Date-Based Generic Views 343

The Django Book, Release 2.0

Template Context

In addition to extra_context, the template’s context will be as follows:

•week: A datetime.date object representing the ﬁrst day of the given week.

•object_list: A list of objects available for the given week. This variable’s name depends on the

template_object_name parameter, which is ’object’ by default. If template_object_name is

’foo’, this variable’s name will be foo_list.

25.4.5 Day Archives

View function:django.views.generic.date_based.archive_day

This view generates all objects in a given day.

Example

urlpatterns = patterns(‘’,

# ...

(

r’^(?P<year>d{4})/(?P<month>[a-z]{3})/(?P<day>d{2})/$’,

date_based.archive_day,

book_info

)

Required Arguments

•year: The four-digit year for which the archive serves (a string).

•month: The month for which the archive serves, formatted according to the month_format argument.

•day: The day for which the archive serves, formatted according to the day_format argument.

•queryset: A QuerySet of objects for which the archive serves.

•date_field: The name of the DateField or DateTimeField in the QuerySet‘s model that the date-

based archive should use to determine the objects on the page.

Optional Arguments

•month_format: A format string that regulates what format the month parameter uses. See the detailed

explanation in the “Month Archives” section, above.

•day_format: Like month_format, but for the day parameter. It defaults to "%d" (the day of the month

as a decimal number, 01-31).

•allow_future: A Boolean specifying whether to include “future” objects on this page, as described in the

previous note.

This view may also take these common arguments (see Table C-1):

•allow_empty

•context_processors

344 Chapter 25. Appendix C: Generic View Reference

The Django Book, Release 2.0

•extra_context

•mimetype

•template_loader

•template_name

•template_object_name

Template Name

If template_name isn’t speciﬁed, this view will use the template <app_label>/<model_name>_archive_day.html

by default.

Template Context

In addition to extra_context, the template’s context will be as follows:

•day: A datetime.date object representing the given day.

•next_day: A datetime.date object representing the next day. If the next day is in the future, this will be

None.

•previous_day: A datetime.date object representing the previous day. Unlike next_day, this will

never be None.

•object_list: A list of objects available for the given day. This variable’s name depends on the

template_object_name parameter, which is ’object’ by default. If template_object_name is

’foo’, this variable’s name will be foo_list.

25.4.6 Archive for Today

The django.views.generic.date_based.archive_today view shows all objects for today. This is ex-

actly the same as archive_day, except the year/month/day arguments are not used, and today’s date is used

instead.

Example

urlpatterns = patterns(‘’,

# ...

(r’^books/today/$’, date_based.archive_today, book_info),

)

25.4.7 Date-Based Detail Pages

View function:django.views.generic.date_based.object_detail

Use this view for a page representing an individual object.

This has a different URL from the object_detail view; the object_detail view uses URLs like

/entries/<slug>/, while this one uses URLs like /entries/2006/aug/27/<slug>/.

Note: If you’re using date-based detail pages with slugs in the URLs, you probably also want to use the

25.4. Date-Based Generic Views 345

The Django Book, Release 2.0

unique_for_date option on the slug ﬁeld to validate that slugs aren’t duplicated in a single day. See Appendix A

for details on unique_for_date.

Example

This one differs (slightly) from all the other date-based examples in that we need to provide either an object ID or a

slug so that Django can look up the object in question.

Since the object we’re using doesn’t have a slug ﬁeld, we’ll use ID-based URLs. It’s considered a best practice to use

a slug ﬁeld, but in the interest of simplicity we’ll let it go.

urlpatterns = patterns(‘’,

# ...

(

r’^(?P<year>d{4})/(?P<month>[a-z]{3})/(?P<day>d{2})/(?P<object_id>[w-]+)/$’,

date_based.object_detail,

book_info

)

Required Arguments

•year: The object’s four-digit year (a string).

•month: The object’s month, formatted according to the month_format argument.

•day: The object’s day, formatted according to the day_format argument.

•queryset: A QuerySet that contains the object.

•date_field: The name of the DateField or DateTimeField in the QuerySet‘s model that the

generic view should use to look up the object according to year,month, and day.

You’ll also need either:

•object_id: The value of the primary-key ﬁeld for the object.

or:

•slug: The slug of the given object. If you pass this ﬁeld, then the slug_field argument (described in the

following section) is also required.

Optional Arguments

•allow_future: A Boolean specifying whether to include “future” objects on this page, as described in the

previous note.

•day_format: Like month_format, but for the day parameter. It defaults to "%d" (the day of the month

as a decimal number, 01-31).

•month_format: A format string that regulates what format the month parameter uses. See the detailed

explanation in the “Month Archives” section, above.

•slug_field: The name of the ﬁeld on the object containing the slug. This is required if you are using the

slug argument, but it must be absent if you’re using the object_id argument.

346 Chapter 25. Appendix C: Generic View Reference

The Django Book, Release 2.0

•template_name_field: The name of a ﬁeld on the object whose value is the template name to use. This

lets you store template names in the data. In other words, if your object has a ﬁeld ’the_template’ that

contains a string ’foo.html’, and you set template_name_field to ’the_template’, then the

generic view for this object will use the template ’foo.html’.

This view may also take these common arguments (see Table C-1):

•context_processors

•extra_context

•mimetype

•template_loader

•template_name

•template_object_name

Template Name

If template_name and template_name_field aren’t speciﬁed, this view will use the template

<app_label>/<model_name>_detail.html by default.

Template Context

In addition to extra_context, the template’s context will be as follows:

•object: The object. This variable’s name depends on the template_object_name parameter, which is

’object’ by default. If template_object_name is ’foo’, this variable’s name will be foo.

25.4. Date-Based Generic Views 347

The Django Book, Release 2.0

348 Chapter 25. Appendix C: Generic View Reference

CHAPTER 26

Appendix D: Settings

Your Django settings ﬁle contains all the conﬁguration of your Django installation. This appendix explains how

settings work and which settings are available.

26.1 What’s a Settings File?

Asettings ﬁle is just a Python module with module-level variables.

Here are a couple of example settings:

DEBUG =False

DEFAULT_FROM_EMAIL =’webmaster@example.com’

TEMPLATE_DIRS =(’/home/templates/mike’,’/home/templates/john’)

Because a settings ﬁle is a Python module, the following apply:

• It must be valid Python code; syntax errors aren’t allowed.

• It can assign settings dynamically using normal Python syntax, for example:

MY_SETTING =[str(i) for iin range(30)]

• It can import values from other settings ﬁles.

26.1.1 Default Settings

A Django settings ﬁle doesn’t have to deﬁne any settings if it doesn’t need to. Each setting has a sensible default value.

These defaults live in the ﬁle django/conf/global_settings.py.

Here’s the algorithm Django uses in compiling settings:

• Load settings from global_settings.py.

• Load settings from the speciﬁed settings ﬁle, overriding the global settings as necessary.

Note that a settings ﬁle should not import from global_settings, because that’s redundant.

26.1.2 Seeing Which Settings You’ve Changed

There’s an easy way to view which of your settings deviate from the default settings. The command manage.py

diffsettings displays differences between the current settings ﬁle and Django’s default settings.

manage.py is described in more detail in Appendix F.

349

The Django Book, Release 2.0

26.1.3 Using Settings in Python Code

In your Django applications, use settings by importing the object django.conf.settings, for example:

from django.conf import settings

if settings.DEBUG:

# Do something

Note that django.conf.settings isn’t a module – it’s an object. So importing individual settings is not possible:

from django.conf.settings import DEBUG # This won’t work.

Also note that your code should not import from either global_settings or your own settings ﬁle.

django.conf.settings abstracts the concepts of default settings and site-speciﬁc settings; it presents a single

interface. It also decouples the code that uses settings from the location of your settings.

26.1.4 Altering Settings at Runtime

You shouldn’t alter settings in your applications at runtime. For example, don’t do this in a view:

from django.conf import settings

settings.DEBUG =True # Don’t do this!

The only place that settings should be deﬁned in is a settings ﬁle.

26.1.5 Security

Because a settings ﬁle contains sensitive information, such as the database password, you should make every attempt

to limit access to it. For example, change its ﬁle permissions so that only you and your Web server’s user can read it.

This is especially important in a shared-hosting environment.

26.1.6 Creating Your Own Settings

There’s nothing stopping you from creating your own settings, for your own Django applications. Just follow these

conventions:

• Use all uppercase for setting names.

• For settings that are sequences, use tuples instead of lists. Settings should be considered immutable and

shouldn’t be changed once they’re deﬁned. Using tuples mirrors these semantics.

• Don’t reinvent an already existing setting.

26.2 Designating the Settings: DJANGO_SETTINGS_MODULE

When you use Django, you have to tell it which settings you’re using. Do this by using the environment variable

DJANGO_SETTINGS_MODULE.

The value of DJANGO_SETTINGS_MODULE should be in Python path syntax (e.g., mysite.settings). Note

that the settings module should be on the Python import search path (PYTHONPATH).

Tip:

350 Chapter 26. Appendix D: Settings

The Django Book, Release 2.0

A good guide to PYTHONPATH can be found at http://diveintopython.org/getting_to_know_python/everything_is_an_object.html.

26.2.1 The django-admin.py Utility

When using django-admin.py (see Appendix F), you can either set the environment variable once or explicitly

pass in the settings module each time you run the utility.

Here’s an example using the Unix Bash shell:

export DJANGO_SETTINGS_MODULE=mysite.settings

django-admin.py runserver

Here’s an example using the Windows shell:

set DJANGO_SETTINGS_MODULE=mysite.settings

django-admin.py runserver

Use the --settings command-line argument to specify the settings manually:

django-admin.py runserver --settings=mysite.settings

The manage.py utility created by startproject as part of the project skeleton sets

DJANGO_SETTINGS_MODULE automatically; see Appendix F for more about manage.py.

26.2.2 On the Server (mod_python)

In your live server environment, you’ll need to tell Apache/mod_python which settings ﬁle to use. Do that with

SetEnv:

SetHandler python-program

PythonHandler django.core.handlers.modpython

SetEnv DJANGO_SETTINGS_MODULE mysite.settings

</Location>

For more information, read the Django mod_python documentation online at

http://docs.djangoproject.com/en/dev/howto/deployment/modpython/.

26.3 Using Settings Without Setting DJANGO_SETTINGS_MODULE

In some cases, you might want to bypass the DJANGO_SETTINGS_MODULE environment variable. For example, if

you’re using the template system by itself, you likely don’t want to have to set up an environment variable pointing to

a settings module.

In these cases, you can conﬁgure Django’s settings manually. Do this by calling

django.conf.settings.configure(). Here’s an example:

from django.conf import settings

settings.configure(

DEBUG =True,

TEMPLATE_DEBUG =True,

TEMPLATE_DIRS =[

’/home/web-apps/myapp’,

26.3. Using Settings Without Setting DJANGO_SETTINGS_MODULE 351

The Django Book, Release 2.0

’/home/web-apps/base’,

]

)

Pass configure() as many keyword arguments as you’d like, with each keyword argument representing a setting

and its value. Each argument name should be all uppercase, with the same name as the settings described earlier. If a

particular setting is not passed to configure() and is needed at some later point, Django will use the default setting

value.

Conﬁguring Django in this fashion is mostly necessary – and, indeed, recommended – when you’re using a piece of

the framework inside a larger application.

Consequently, when conﬁgured via settings.configure(), Django will not make any modiﬁcations to the

process environment variables. (See the explanation of TIME_ZONE later in this appendix for why this would normally

occur.) It’s assumed that you’re already in full control of your environment in these cases.

26.3.1 Custom Default Settings

If you’d like default values to come from somewhere other than django.conf.global_settings, you can pass

in a module or class that provides the default settings as the default_settings argument (or as the ﬁrst positional

argument) in the call to configure().

In this example, default settings are taken from myapp_defaults, and the DEBUG setting is set to True, regardless

of its value in myapp_defaults:

from django.conf import settings

from myapp import myapp_defaults

settings.configure(default_settings=myapp_defaults, DEBUG=True)

The following example, which uses myapp_defaults as a positional argument, is equivalent:

settings.configure(myapp_defaults, DEBUG =True)

Normally, you will not need to override the defaults in this fashion. The Django defaults are sufﬁciently tame that you

can safely use them. Be aware that if you do pass in a new default module, it entirely replaces the Django defaults,

so you must specify a value for every possible setting that might be used in that code you are importing. Check in

django.conf.settings.global_settings for the full list.

26.3.2 Either conﬁgure() or DJANGO_SETTINGS_MODULE Is Required

If you’re not setting the DJANGO_SETTINGS_MODULE environment variable, you must call configure() at some

point before using any code that reads settings.

If you don’t set DJANGO_SETTINGS_MODULE and don’t call configure(), Django will raise an

EnvironmentError exception the ﬁrst time a setting is accessed.

If you set DJANGO_SETTINGS_MODULE, access settings values somehow, and then call configure(), Django

will raise an EnvironmentError stating that settings have already been conﬁgured.

Also, it’s an error to call configure() more than once, or to call configure() after any setting has been

accessed.

It boils down to this: use exactly one of either configure() or DJANGO_SETTINGS_MODULE, and only once.

352 Chapter 26. Appendix D: Settings

The Django Book, Release 2.0

26.4 Available Settings

The following sections consist of a list of the main available settings, in alphabetical order, and their default values.

26.4.1 ABSOLUTE_URL_OVERRIDES

Default:{} (empty dictionary)

This is a dictionary mapping "app_label.model_name" strings to functions that take a model object and return

its URL. This is a way of overriding get_absolute_url() methods on a per-installation basis. Here’s an example:

ABSOLUTE_URL_OVERRIDES ={

’blogs.weblog’:lambda o: "/blogs/%s/" %o.slug,

’news.story’:lambda o: "/stories/%s/%s/" %(o.pub_year, o.slug),

}

Note that the model name used in this setting should be all lowercase, regardless of the case of the actual model class

name.

26.4.2 ADMIN_MEDIA_PREFIX

Default:’/media/’

This setting is the URL preﬁx for admin media: CSS, JavaScript, and images. Make sure to use a trailing slash.

26.4.3 ADMINS

Default:() (empty tuple)

This is a tuple that lists people who get code error notiﬁcations. When DEBUG=False and a view raises an exception,

Django will email these people with the full exception information. Each member of the tuple should be a tuple of

(Full name, e-mail address), for example:

((’John’,’john@example.com’), (’Mary’,’mary@example.com’))

Note that Django will email all of these people whenever an error happens.

26.4.4 ALLOWED_INCLUDE_ROOTS

Default:() (empty tuple)

This is a tuple of strings representing allowed preﬁxes for the {% ssi %} template tag. This is a security measure,

so that template authors can’t access ﬁles that they shouldn’t be accessing.

For example, if ALLOWED_INCLUDE_ROOTS is (’/home/html’, ’/var/www’), then {% ssi

/home/html/foo.txt %} would work, but {% ssi /etc/passwd %} wouldn’t.

26.4.5 APPEND_SLASH

Default:True

This setting indicates whether to append trailing slashes to URLs. This is used only if CommonMiddleware is

installed (see Chapter 17). See also PREPEND_WWW.

26.4. Available Settings 353

The Django Book, Release 2.0

26.4.6 CACHE_BACKEND

Default:’locmem://’

This is the cache back-end to use (see Chapter 15).

26.4.7 CACHE_MIDDLEWARE_KEY_PREFIX

Default:’’ (empty string)

This is the cache key preﬁx that the cache middleware should use (see Chapter 15).

26.4.8 DATABASE_ENGINE

Default:’’ (empty string)

This setting indicates which database back-end to use, e.g. ’postgresql_psycopg2’, or ’mysql’.

26.4.9 DATABASE_HOST

Default:’’ (empty string)

This setting indicates which host to use when connecting to the database. An empty string means localhost. This

is not used with SQLite.

If this value starts with a forward slash (’/’) and you’re using MySQL, MySQL will connect via a Unix socket to the

speciﬁed socket:

DATABASE_HOST =’/var/run/mysql’

If you’re using MySQL and this value doesn’t start with a forward slash, then this value is assumed to be the host.

26.4.10 DATABASE_NAME

Default:’’ (empty string)

This is the name of the database to use. For SQLite, it’s the full path to the database ﬁle.

26.4.11 DATABASE_OPTIONS

Default:{} (empty dictionary)

This is extra parameters to use when connecting to the database. Consult the back-end module’s document for available

keywords.

26.4.12 DATABASE_PASSWORD

Default:’’ (empty string)

This setting is the password to use when connecting to the database. It is not used with SQLite.

354 Chapter 26. Appendix D: Settings

The Django Book, Release 2.0

26.4.13 DATABASE_PORT

Default:’’ (empty string)

This is the port to use when connecting to the database. An empty string means the default port. It is not used with

SQLite.

26.4.14 DATABASE_USER

Default:’’ (empty string)

This setting is the username to use when connecting to the database. It is not used with SQLite.

26.4.15 DATE_FORMAT

Default:’N j, Y’ (e.g., Feb. 4, 2003)

This is the default formatting to use for date ﬁelds on Django admin change-list pages – and, possibly, by other parts

of the system. It accepts the same format as the now tag (see Appendix E, Table E-2).

See also DATETIME_FORMAT,TIME_FORMAT,YEAR_MONTH_FORMAT, and MONTH_DAY_FORMAT.

26.4.16 DATETIME_FORMAT

Default:’N j, Y, P’ (e.g., Feb. 4, 2003, 4 p.m.)

This is the default formatting to use for datetime ﬁelds on Django admin change-list pages – and, possibly, by other

parts of the system. It accepts the same format as the now tag (see Appendix E, Table E-2).

See also DATE_FORMAT,DATETIME_FORMAT,TIME_FORMAT,YEAR_MONTH_FORMAT, and

MONTH_DAY_FORMAT.

26.4.17 DEBUG

Default:False

This setting is a Boolean that turns debug mode on and off.

If you deﬁne custom settings, django/views/debug.py has a HIDDEN_SETTINGS regular expression that will

hide from the DEBUG view anything that contains ’SECRET,PASSWORD, or PROFANITIES’. This allows untrusted

users to be able to give backtraces without seeing sensitive (or offensive) settings.

Still, note that there are always going to be sections of your debug output that are inappropriate for public consumption.

File paths, conﬁguration options, and the like all give attackers extra information about your server. Never deploy a

site with DEBUG turned on.

26.4.18 DEFAULT_CHARSET

Default:’utf-8’

This is the default charset to use for all HttpResponse objects, if a MIME type isn’t manually speciﬁed. It is

used with DEFAULT_CONTENT_TYPE to construct the Content-Type header. See Appendix G for more about

HttpResponse objects.

26.4. Available Settings 355

The Django Book, Release 2.0

26.4.19 DEFAULT_CONTENT_TYPE

Default:’text/html’

This is the default content type to use for all HttpResponse objects, if a MIME type isn’t manually speciﬁed.

It is used with DEFAULT_CHARSET to construct the Content-Type header. See Appendix G for more about

HttpResponse objects.

26.4.20 DEFAULT_FROM_EMAIL

Default:’webmaster@localhost’

This is the default email address to use for various automated correspondence from the site manager(s).

26.4.21 DISALLOWED_USER_AGENTS

Default:() (empty tuple)

This is a list of compiled regular expression objects representing User-Agent strings that are not allowed to visit any

page, systemwide. Use this for bad robots/crawlers. This is used only if CommonMiddleware is installed (see

Chapter 17).

26.4.22 EMAIL_HOST

Default:’localhost’

This is the host to use for sending email. See also EMAIL_PORT.

26.4.23 EMAIL_HOST_PASSWORD

Default:’’ (empty string)

This is the password to use for the SMTP server deﬁned in EMAIL_HOST. This setting is used in conjunction with

EMAIL_HOST_USER when authenticating to the SMTP server. If either of these settings is empty, Django won’t

attempt authentication.

The Django Book User Manual

Navigation menu

Versions of this User Manual:

Views

Navigation