📋 Table of Contents
🚀 Getting Started
System Requirements
Installation & Setup
1. Backend Setup
2. Port Mapping (Required)
3. Frontend Setup
http://localhost:3000
in your browser.
You should see the ConceptViz interface load successfully.
📖 Complete Tutorial: Exploring Superhero Features
Follow this comprehensive example to explore superhero-related concepts in Large Language Models using ConceptViz's Identification → Interpretation → Validation workflow. This tutorial demonstrates how to discover, analyze, and validate meaningful concept representations in LLMs.
🎮 Interactive Demo
Experience the complete workflow hands-on with our live demonstration:
🚀 Launch Interactive Demo💡 Tip: Open the demo in a separate tab and follow along with the tutorial steps below to see each visualization fill in progressively!
🔍 Concept Query View - Define Your Concept
What you'll do: Start by querying "superhero" to explore superhero-related features in the model.
🎮 Try it: Launch the demo and enter "superhero" in the query box to begin.
- Enter your concept: Type "superhero" in the query input field
- Observe optimization: The system automatically suggests query improvements based on semantic analysis
- Model selection: Use the dropdown menu to switch between different base models if needed

🎯 SAE Discovery View - Find Relevant Models
What you'll do: Select the most relevant SAE model based on concept-relevance metrics across different layers.
🎮 Try it: In the demo, examine the layer rankings and click on layer 11 to load the corresponding SAE.
- Review layer rankings: Examine the heat map showing concept relevance across all network layers
- Identify optimal layers: Look for layers with the highest relevance scores (darker blue bars)
- Select SAE model: Click on the interested layer to load the corresponding SAE
- Proceed to exploration: Move to feature exploration once the SAE is loaded
🗺️ Feature Explorer View - Navigate the Semantic Space
What you'll do: Explore the 2D semantic space to identify superhero-related features and understand their clustering patterns.
🎮 Try it: In the demo, navigate the 2D visualization and use the sidebar to explore features ranked by similarity to your superhero query, and select the top feature 6610.
- Navigate the visualization: Pan and zoom to explore different semantic regions
- Identify concept clusters: Look for areas where superhero-relevant features (blue points) concentrate
- Select features: Click on individual feature points to examine their details
- Use semantic labels: Reference cluster topic labels like "individuals", "power", or "spiritual"
- Expand sidebar: Open the left sidebar to see features ranked by similarity to your query
- Scroll through rankings: Browse features ordered by relevance scores
- Observe highlighting: Notice how the main view highlights the currently focused feature in red
🔬 Feature Details View - Deep Dive Analysis
What you'll do: Examine detailed semantic information about selected superhero-related features.
🎮 Try it: In the demo, click on a feature point to open the detailed analysis panel and examine vocabulary projections and activation patterns. Don't forget the Button 'VALIDATE'.
- Read feature explanations: Review the automated description (e.g., "references to superhero characters and their narratives")
- Examine vocabulary space: Check which tokens this feature most strongly promotes or suppresses in the model's predictions
- Analyze activation patterns: Use the activation-similarity matrix to identify potential explanation discrepancies
- Review token statistics: Study the maximum activation tokens to understand feature behavior
- Identify anomalies: Look for high-activation samples with low semantic similarity to the explanation
⚡ Input Activation View - Test with Custom Inputs
What you'll do: Validate feature behavior by testing with custom superhero-related text inputs.
🎮 Try it: In the demo, test custom superhero inputs to see token activations and co-activating features (Unfortunately, the data has been preset, please feel free to enter).
- Enter test sentences: Type superhero-related text like "My favorite hero is Batman"
- Observe token activations: See which words trigger the strongest feature responses
- Select interesting tokens: Click on tokens with high activation (e.g., "Superman", "Batman")
- Discover co-activating features: System highlights other features that respond to selected tokens
- Switch between features: Use the test history dropdown to compare different feature responses


🎛️ Output Steering View - Causal Validation
What you'll do: Verify causal relationships by manipulating feature activations and observing their impact on model outputs (Sorry, we also have pre-made data here).
🎮 Try it: In the demo, set up steering prompts and adjust activation sliders to observe how feature manipulation affects text generation.
- Set up steering prompt: Enter an incomplete sentence like "My favorite hero is"
- Configure steering strength: Adjust sliders to set different activation levels (positive/negative)
- Generate completions: Observe how different steering strengths affect the output
- Compare results: Analyze differences between steered and unsteered generations
- Validate causality: Confirm that steering toward superhero features produces superhero-related completions
- Identified superhero-related features through concept querying and SAE discovery
- Interpreted feature semantics through spatial exploration and detailed analysis
- Validated feature behavior through custom input testing and causal steering
🆘 Troubleshooting
Common Issues
❌ Frontend can't connect to backend
- Check if backend is running on port 5000
- Verify port mapping:
ssh -L 6006:localhost:5000 localhost
- Check firewall settings
- Try accessing
http://localhost:6006/api/health
directly
🖥️ Display issues (layout problems)
- Ensure you're using a 2K display (2560×1440)
- Set browser zoom to 100%
- Try fullscreen mode (F11)
- Clear browser cache and reload